FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
C T ACCAT T TATGT T CGT T AT AAGC AAGT T CAT GC TC TGAGT CC T GAAGAGAACGT TAT CAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
C C AGAAAACAACCC T T T T T GC TGCACAT GTAAGTGGAGC TGT GCT T ACC T T T GG TATGGGCT CAT 
TATATAT GTT TGTT CAGACCAT CC T T TCC TACCAAATGCAGCCCAAAAT CCAT GGCAAACAAGT C 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGT TATGTGCTT CACATGAT CAC TACT GCAGCAGAATGGTC TAT GT CAT TTTCCTTCTTT 
GGT T T TT T CCT GAC T TACAT T CGTGATT T T CAGAAAAT TT CT T TAC GGGT GGAAGCCAAT T TACA 
T GGAT TAACC C TC TAT GACAC T GCACCT T GCCC TATTAACAATGAACGAACACGGCTACT T T CCA 
GAGATAT T TGATGAAAGGATAAAATAT TT CTGTAATGAT TATGAT TCTCAGGGAT TGGGGAAA.GG 
T T C AC AGAAGT T GC T TAT T C T T CT CT GAAAT T T TCAACCACT T AATCAAGGC T GACAGTAACACT 
GAT GAAT GCT G AT AAT C AGGAAAC AT G AAAG AAGC CAT T T GAT AGAT TAT T C T AAAG GAT AT CAT 
CAAGAAGACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 

ATG 



Ml 



FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LC I AT I YVRYKQVHALS PEENVI I KLNKAGLVLG I LSCLGLS I VANFQKT TLFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHM I TTAAEWSMS FS FFGFFLTY IRDFQKI SLRVEANLHGLTLYDTAPC PINNERTR 
LLSRDI 

Important features : 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains : 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACAT GGGATCTT ATCCATAGAACAGCTCAT CAGCCG GGTT GGTGT GATT GGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 

AACCATCAGGTT TCT GGGGAATGAT AAAAAGT GTTACCACT TCAGCAT CAGGAAGTGAAAATCTTACT 

CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 

ATAT GCTACCAAGGAGAGAATAGAATACT CCAAAACCTT CAAGGGGAAATAT TTTAATTTTCTTGGTT 

ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

GGGAAAAC GGAT CCT GTCACAAGAGGCAT TGAGATCACT GT GAATTAT CTGGGAATCCAAT TT GATGT 

GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 

CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 

AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 

ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 

GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 

GAT AT AAGAGGGGGGAAAAAT GGAAC CAGG G CC T GACAT T T TATAAAC AAACAAAAT GC T AT GGTAGC 

ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 

AACAT GAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCAT CCCGT GTGGATAT GAGGCTGG 

TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 

CTAT GGTAGCTGAGCCAAACACGTAGGATTTCCGTT T TAAGGT T CACAT GGAAAAGGTTATAGCTTTG 

CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 

ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 



MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FP I LS PKHGI LS I EQLI SRVGVIGVTLMALLSGFGAVNCPYT YMS YFLRNVTDTDI LALERRLLQ 
TMDMIISKKKRMRMARRTMFQKGEVHNKPSGFWGMIKS 

LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGI QFDVKFWS QHI S FI LVG 1 1 1 VT S IRGLLI TLTKFFYAI S S SKSSNVI VLLLAQIMGMY 
FVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSALSSILFLYLAHKQAPEKQMAP 



amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 



Important features: 
Signal peptide: 



If! 



amino acids 1-23 



Potential transmembrane domains: 




N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 



425-444 



Eukaryotic cobalamin -binding proteins 



amino acids 151-160 
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FIGURE 5 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GT CATGCAGAATT ATGGGGAT CACCCT T G T GAGCAAAAAGGCGAACCAGCAGC T GAAT T T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GC CGACAGT T TGCAGCC TATTGTTACAAC TCATCTGATACT T GGACTAACT CGT GCAT T CCAGAA 
AT TATCAC CACCAAAGATCCCAT AT T CAACACT CAAACT GCAACACAAACAACAGAAT T TAT T GT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 

GAAAC TAG CAC CAT G T C T AC AGAAAC T GAAC CAT T T G T T GAAAAT AAAG CAGCAT T CAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
T GAGGAATCAAAGAAAAC TGATAAAAAC CCAGAAGAGTCCAAGAGT CCAAGC AAAAC TACCGT GC 
GAT GCCT GGAAGCT GAAGT T TAGATGAGACAGAAAT GAGGAGACACACCT GAGGCTGGT T T C T T T 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 

CCC T TGGT T CC TAAC TGGAAT CAGC T CAGGACT GC CATTGGAC TATGGAG T GCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GC T C TGAAAGAGAAACACGTAT CCCACCT GACAT GTC CT TCTGAGCC CGGT AAGAGCAAAAGAAT 
GGCAGAAAAGT T TAGCCCCTGAAAGCCAT GGAGAT TC TCATAAC T TGAGACC TAATC TC TG TAAA 
GCTAAAATAAAGAAATAGAACAAGGC T GAGGAT ACGACAGTACACTGT CAGCAGGGAC T GTAAAC 
ACAGACAGGGTCAAAGTGTTTTCTCTGAACACATTGAGTTGGAATCACTGTTTAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAAC T C TTAT AAAT T T C TAT T T T TAT CTGAGT TACAGAAAT GAT T ACTAAGGAAGAT T 
ACTCAGTAATT TGTT TAAT^AAGTAATAAAAT TCAACAAACATTTGCTGAATAGCTACTATAT GTC 
AAGT GCT GT GCAAGGTAT TACAC TC TGTAAT TGAATAT TAT T CC T CAAAAAAT TGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T T T TAT TT T T GC TGAGAC TAATC T TAT TCAT T T TC T CT AATATGGCAACCAT TATAACCT TAATT 
T ATTAT TAACATACC T AAGAAGTACAT T GT TACC TCTATAT ACCAAAGCACAT T T T AAAAGT GCC 
ATTAACAAATGTATCACTAGCCCTCCTTTTTCCAACAAGAAGGGACTGAGAGATGCAGAAATATT 

TGTGACAAAAAAT TAAAGCAT TTAGAAAACT T 



FIGURE 6 



MARCFSLVLLLTS IWTTRLLVQGSLRAEELS IQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L S LAGKDQVE T ALKAS FE T C S YGWVGDG FW I S R I S PNPKCGKNGVGVL I WKVPVS RQFAAYC YN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVIALLFFGAAAGLGFCYVK 
RYVKAFP FTNKNQQKEMI E TKVYKEEKANDSNPNEE SKKT DKNPEESKS PSKT TVRCLEAEV 



a , Signal sequence: 

P'i amino acids 1-16 



ff! 



13 
tfi 



Transmembrane domain: 

amino acids 235-254 



N-glycosylation site. 



amino acids 53-57, 130-134, 289-293 



Casein kinase II phosphorylation site. 

W 

n* amino acids 145-149, 214-218 

Pj Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT T GCGCAGC GCGGT GGAAGAGAT GGAGGCAGAAGAAGCT GC T GC TA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGG T TGGAAATAATACCATCCATGT GCACCGAGAAATT CACAAGATAACCAACAACCAGAC 
TGGACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
,^ ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
H ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
y GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
Cf ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
SP ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
||| CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
||| CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAA.TAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCGTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
P TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
C TCCAC T ACCCCACACCAGCCT TGGTGCCACCAAAAG TGC T CCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGT7\AA 
C TAC TGTT AGGAACAGCAGT GT TC TCACAGTGT GGGGCAGCCGT CCTT CT AATGAAGACAAT GAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT T AACC TGCAGAAACAG TACT TAGGTAATT GTAGGGCGAGGAT TAT AAATGAAAT T TGC 
AAAATCACT TAGCAGCAACTGAAGACAAT TATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT T GT ATAAGCAT GCT T TC T TTGAGT T TTAAAT TATG TATAAACAT AAGT TGCAT T T AGAA 
AT CAAGCAT AAAT CAC T TCAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 



III 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWIANLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEE I 



Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 




Signal sequence: 



N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 



amino acids 1-19 



iii 



N-myristoylation site. 

amino acids 202-208, 217-223 



Amidation site. 



amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 

GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 

GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 

AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 

CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 

ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 

GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 

CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 

TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 

ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 

GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 

ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 

GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 

GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 

AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 

CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 

CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 

TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 

AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 

GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 

GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 

AAG AT T T TAT T AAAGAT AT T T T G T T AAC T C 



FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQDI PTFPL I SAFIRTLRYHTGSLAFGAL ILTLVQIARVIL 
E Y I DHKLRGVQNPVARC IMCC FKCCLWCLEKF I KFLNRNAY IMIAI YGKNFCVS AKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCYDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Ami da ti on site. 

amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
H ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
U CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

M GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

§71 

T] TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
iff CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
jg CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
^1 TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
* G GGC AAC G AGACAGT T GT G GCAG GCCC CGAGG G CT AT GAGACC CAGT GG T G GGAT GC CC CGAGCAT T G 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

m 

^ AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
J| f GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
p ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
f|| ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 

YKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 

NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 

ECDSRAWYAGLFFFTLLFYLLSIAAVAIjMFMYYTEPSGCHEGKVFISLNLTFCVCVS iaavlpkv 

QDAQPNSGLLQASVI TLYTMFVTWSALS S I PEQKCNPHLPTQLGNETWAGPEGYETQWWDAPS I 

VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 

SFFHFCLVLASLHVMMTLTNWYKPGETRKMISTO 

FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGG ATGA ACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGAT TGAGTCC TAT GAAGGAAGGGAAAAGAAAGGCATAT C TGAT GT CAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGGAGGCAT TGAGAACACATTAGAGAAGGAGGT GATGCAGTATGAC T AC TAT TC TT CATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCAT TGGTGGGCAATAGCGT TGACAACGGCAGTGACCAGTGCCT TTT TACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
C T TGCCT GGAT TGAGACGT GGT TCC T GGAT T TCAAAGT GT T ACC TCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACTT TTAGAACT ATGAG TACTACTTTTGT TAAAT GTGAAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACT GCTG GCT T TAT T GAACAGC T AATAAAGAT T TATT TAT T G TAATACCT CACAAACGT T G TAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
T CAG TGAGAC TGAGCC TGATGTGTTAACAAATAGGT GAAGAAAGTC T TGTGCTGTAT TCCTAATC 
AAAAGAC T T AATATAT T GAAGTAACACT T T T T TAGTAAGCAAGATACCT T T TTAT TTCAAT T CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T TGCAAAAGGGGAAGAAAGGAAT TGCGAATACATGTAAAATGT CACCAGACAT T TGTAT TATT T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAAT GACT TAAACCAT TCATATCATGTTTCCTTT GCGT TCAGCCAATT TCAATTAAAATGAA 
C TAAAT T AAAAA 
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FIGURE 14 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNVNGGIENTLEKEVMQYDYYSSYFDI FLLAVFRFKVL I LAYAVCRLRHWWAIALTT 
AVTSAFLIJ^C^ILSKLFSQGAFGWLPIISFIIAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 



Transmembrane domains: 



amino acids 54-72, 100-118, 130-144, 146-166 




N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



fU 
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FIGURE 15 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 

CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 

GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 

CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 

CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 

AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 

TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 

CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 

CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 

AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 

CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 

GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 

TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 

CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 

TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 

TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 

TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 

CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 

CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 

CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 

ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGC C T GTAC T G T GAGAG C CAGAT GGGG CAG GG GAC AC GG C CC AGC C C TACAC CAG TC AC G C C GAG 

GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 

AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 

GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 

GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 

ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 

CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 

ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 

GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 

ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 

GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 

CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 

CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 

TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 

GGCTGTGTGAC TC TAG T CT T GGCCCCAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGAT GC 

T T T AGGAACAT GT TT TGCT T T T T T AAAAT ATATATAT T TAT AAGAGAT CCT T TCCCAT T TAT TCT 

GGGAAGAT GT T T T TCAAACT CAGAGACAAGGAC T T TGGT T T TTGTAAGACAAACGATGAT AT GAA 

GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 



FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGS FAGLPGLQLLDLS QNQ I AS LPS GVFQPLANLSNLDLTANRLHE I TNET FRGLRRLERL Y 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLIAAARNPFNCT 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 

SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 

VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAYHSNHAPVTQAREGNLPLLIAPALAAV 

LLAALAAVGAAYCWRGRiMAAAAQDKGQVGPGAG 

CEVPIjMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites . 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 

GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 

AT GAAGAAGGCAGCCAGGAT GAAT CC T TAGAT TCCAAGACTAC TT TGACATCAGAT GAGT CAGTA 

AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 

ATTAGAATCCTCTAT TCAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGTGTCACAG 

AAGATATCAGCTT TCTAGAGTCTCCAAAT CCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 

CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 

T T T C CT AGAT AAGGAG TAT GAT GAAT GT ACAT C AGAT G G GAG GGAAGAT G GC AGAC TGTGGTGTG 

CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 

AAGAGAC GGCAGATGCAGGAAGCAGAAAT GAT G TAT CAAAC T G GAAT GAAAAT CC T TAAT GGAAG 

CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 

CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 

GCAGCGAGAGAGAT GT T T GAGAAGC TGAC T GAGGAAGGCT CT CCCAAGGGACAGAC TGCT CTT GG 

CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 

TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 

AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTTTCAGCTTTCATGATC 

C AG AT T T GC T T G TAT T AAGAC C AAAT AT T C AG T T GAAC T T C C T T C AAAT T C T T G T TAAT GG AT AT 

AACACATGGAAT CTACAT GTAAATGAAAG T TGGTGGAGT CCACAATT T T T CT T TAAAAT GAT TAG 

TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

GAAT TGT CAGAATCATT T T TTACAT TAGAT TAT CATAAT T T TAAAAAT T T T TC TT TAGTT T T TCA 

AAAT T T T GT AAATGGT GGCTATAGAAAAACAACATGAAATAT TATACAAT AT T T TGCAACAATGC 

CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 

TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 

GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 

AAGAT T AAC T CAT T T T T AAT AAAAT TAT G T C T AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 18 



mrvrigltlllcavllslasassdesgsqdesldskttltsdesvkdhttagrwagqifldseesel 
essiqeeedslksqegesvtedisflespnpenkdyeepkkvrkpaltaiegtahgepchfpflfldk 
eydectsdgredgrlwcattydykadekwgfceteeeaakrrqmqeae1nmyqtgmkilngsnkksqkr 
eayrylqkaasmnhtkalervsyallfgdylpqniqaaremfeklteegspkgqtalgflyasglg™ 
s s qaka1vy yt fgalggnl i ahmvlvs rl 



Signal peptide 



sis 



Important features: 
Signal peptide: 

amino acids 1-21 



N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 



f|| Glycosaminoglycan attachment site. 

amino acids 2 67-271 



Microbodies Oterminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AAT T CAGAT T T TAAGCCCAT TC TGCAGT GGAATT T CAT GAAC TAGCAAGAGGACACCAT CT TCT T 

GT A T T AT ACAAGAAAGGAGTGTACC TAT CACACACAGGGGGAAAAATGCT CTT T TGGGTGCT AGG 

CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 

ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 

AAAAAGGGAT T TCAT GTAATCGCT GCCT GT CT GACTGAAT CAGGAT CAACAGC T TTAAAGGCAGA 

AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 

CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 

CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 

CC TGT TT GGAC TCAT CAGT GTGACACTAAATAT GC T T CC T T T GGTCAAGAAAGC TCAAGGGAGAG 

TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 

ATGCAT T GAAC CAGGAT TGTT CAAAACAAACT TGGCAGAT CCAGTAAAGG TAAT T GAAAAAAAAC 

TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 

AGT C TAGACAAACT GAAAGGCAAT AAAT CC TAT GT GAACAT GGACC TC TCT CCGGTGGT AGAGT G 

CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 

GCAGAGC T GGC TAAT CCCAAGGCAGT GTGACTCAGCTAACCACAAAT G TCT CCT CCAGGCTATGA 

AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 

GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 

TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 

ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAi^ 

AAAAAAAAAAAAA 



FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKMTSERLRTVLLDVTDPENVKRT^^ 

REPIEWLFGLISVTLNMLPLVKKAQGRVINVSSVGGRIAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVS C I E PGLFKTNLADPVKVI EKKLAI WE QLS PD I KQQYGEGY IEKS LDKLKGNKS YVNMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 



m 

s 

O 
m 



in 
0 
ni 



Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites . 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CAC T CGC T T T CCAGCACCTCAACACGGACTCGGACACGGAAGGT TT TCT T CT TGGGGAAGTAAAA 
GGT GAAGCCAAGAACAGCAT T AC TGAT TCCCAAAT GGAT GATGTT GAAGT T GT T TAT ACAAT T GA 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGT TGGTACAAATTCCGT 
CGTCATT CAGATCAGATCATGACGT TTAGAGAGAGGCTGCT TCACAAAAACTTGCAGGAGCATT T 
T TCAAACCAAGACC T TGT T TT TCTGC TAT TAACACCAAGTATAATAACAGAAAGC TGCTCTACT C 
ATCGACTGGAACAT TCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTT TAGT GGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TTT TAGCCGAGCAGTACAAACACACAGCTCTAAATTT TT TGAAGAAGATGGATCCTTAAAGGAGG 
TACATAAGATAAATGAAATGTATGCT TCAT TACAAGAGGAAT TAAAGAGTATATGCAAAAAAGTG 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
GAAAAGGAGAGGAGCACAGAT T CAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC T CAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATC TGACCT TAATGGTAGAACACACTGACAT T CCT GAAGCTAGTCCAGC TAGTACACCAC 
AAATCAT TAAGCATAAAGCCT TAGACT TAGATGACAGAT GGCAAT TCAAGAGATC TCGGTTGTTA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTT TTGGTGAATAT TCACGGTCTC 
CTACAT TT TGATCC T T TTAACC T TACAAGGAGAT TTT T TT AT TT GGC T GATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGTAATACACAGATAAC TC T TAGT GCAT T T ACT T C ACAAAGTAC T T T T T CAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT TCTT TAGAAT TGGAAAAGTGAGACCAGGCACAGT GGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT TGAT T GAAGC TAGGAGT T AGAGACCAGCCTGGGCAACGTATT 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTATTTTCAAAATATGGAAA 

G AAAT T TAT AT GAAAAT T TAT C T GAG T CAT T AAAAT T C T C C T T AAG T GAT AC T T T T T T AG AAG T A 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QI QAAREKNI QKDPQENI FLCQALRT FFPNSE FLHSCVMS LKNRHVSKS S CNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE I EKMKG FGEYSRS PT F 

Important features: 
Signal peptide: 
amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGC CGCGCGGCGGAGGGCAGAGT CAGC C GAGC C GAGT C CAGCC GGAC GAGCGGAC CAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGAT GGAGGAGAAGTACAGCCT GCAAAGT AAAGAT GATTTCAAAAGT GT GGT CAGCGAACAGT GCAAT CATTT G 
CAAGCT GT CTTTGCTTCACGTTACAAGAAGTTT GAT GAAT T CTTCAAAGAACT ACTT GAAAAT GCAGAGAAAT CCCT G 
AAT GATAT GTT T GTGAAGACATAT GGC CAT TT ATACAT GCAAAATT CT GAGCTATT TAAAGAT CT CTT CGTAGAGT T G 
AAACGTTACTAC GTGGT GGGAAATGT GAACCTGGAAGAAAT GCTAAAT GACTT CT GGGCT CGC CT CCT GGAGCGGAT G 
TT CCGCCTGGTGAACT CCCAGTACCACTTTACAGATGAGTATCTGGAATGTGT GAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
AT GAT CT ACT GC T CC CACTGCCGGG GTCTC GT GACTGT GAAGC CAT GT TACAACTACT GCT CAAACAT CAT GAGAGGC 
T GTTT GGC CAACCAAGGGGAT CT CGATTTT GAAT GGAACAATTTCATAGAT GCTAT GCT GATGGT GGCAGAGAGGCT A 
GAGGGT CCT TT CAACATT GAAT CGGT CAT GGAT C CCAT CGAT GTGAAGATT T CT GATGCTATTAT GAACAT GCAGGAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGTTT GGAC C GACT GGTTACT GAT GT CAAGGAGAAACT GAAACAG GCCAAGAAATT CT GGT CCT C CCT T C CGAGCAAC 
GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CTGTT T GCAGT GACAGGAAAT GGATTAGCCAACCAGGGCAACAACC CAGAG GT C C AGGTT GACACCAGCAAACCAGAC 
ATACT GAT CCTT CGT CAAAT CAT GGCTCTT C GAGT GAT GAC CAGCAAGAT GAAGAATGCATACAAT GGGAACGAC GTG 
GACT T CTT T GATAT CAGT GAT GAAAGTAGT GGAGAAGGAAGTGGAAGT GGCTGT GAGTATCAGCAGT GC C CTT CAGAG 
TTT GACTACAAT G CCACT GAC CAT GCTG GGAAGAGT GCCAATGAGAAAGC CGACAGTGCT GGT GT CCGT CCT GGGGCA 
CAGGC CTACCT CCT CACT GT CTT CT GCAT CTTGT T CCT GGTTAT GCAGAGAGAGT GGAG ATAAT T CT CAAACT CT GAG 
AAAAAGT GTT CAT CAAAAAGT TAAAAGGCAC CAGT TAT CACTTTT CTAC CATCCTAGT GACTT T GCTT T TT AAAT GAA 
T GGACAACAAT GTACAGTTTTTACT ATGT GGCCACT GGTTTAAGAAGT GCT GACTT TGT TT TCT CATT CAGTTT T GGG 
AGGAAAAG GGACT GTGCATT GAGTT GGTT CCTGCT C CCC CAAACCAT GT TAAACGT GGCT AACAGT GTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CT GAAATATTAAATAGC T GTACAGAAGCAGGT TT TAT TTAT CAT GTT ATCTTAT TAAAAGAAAAAGCCCAAAAAGC 



FIGURE 24 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLTOSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGIAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCIANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPS1WCNDERWAAGNGNEDDCWNGKGKSRYLFAVTGNGIANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 

AGCGCCCACT C T T CCAAT T AAAC AT T C T C AGCC AAGAAGACAGT GAGCACACC TACC AGAC AC TC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGC TAG T G T CAT T T AAC C T T AAAT GCAAT C AG GAAAG TAG C AAAC AGAAG T C AAT AAAT AT T T T T 
AAAT GT CAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVL I S S LLLLLPLMLMSMVS S SLNPGVARGHRDRGQASRRWLQEGGQE CE CKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

ri amino acids 27-33, 4 6-52 

■S3!? 
ff I 

Irs 
%? ? 

. k 



ill 

5 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 

AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 

CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 

TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 

ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 

CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 

AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 

GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 

CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 

TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 

GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 

GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 

GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 

TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 

GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 

AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 

AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 

CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 

CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 

ATGCCAGAGCAAGACTCAAAGAGGCAGAGGT TT TGTTCT CAAATAT TTT TT AATAAATAGACGAA 

ACCACG 



FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 

TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 

RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFM^ 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 

QLEQATHQPIYLPLRGT 

Important features : 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGC T GG GAGATAG GGAACAG AAGAG GGT AG TGGGTGGGC TAG GG GG GC T G C C T TAT T T AAA 
GTGGTTGTTTATGATTCTTATACTAATTTATACAAAGATATTAAGGCCCTGTTCATTAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 30 

MLLLTLLLLLLLLKGSCLE WGLVGAQKVS S ATDAP I RDWAFFP P S FLCLLPHRPAMTCS QAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 



I* 5, Growth factor and cytokines receptors family: 

*!? amino acids 3-18 
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FIGURE 31 

GTT T GAAT T CC T TCAACTAT AC CCACAGT CCAAAAGCAGACT CACT GTGT CCCAGGC TACCAGT T 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTT TAT G GT GGC T G TAT TAT GAC TAT AC C AAC G AC C T C AG CAT AGAAT T G G ACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
y, C TGCTCGTCTTGATTT TTGTTCTCAGAAAGAGAATAAAATTGACAGT TGAGCTT T TCCAAATCAC 

0 AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
© TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
||| AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 

TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
* GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 
J:! CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 

in 

f*| AACCAGAAT GCATATACTACAAC T GC TAT T AAT GGGACAGAT T T C T GTACAT CAGCAAAAGAT GC 

f|J ATTCAAAATCTTGTCCAAGAACTCAAGTCA£TT^ 

Q TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
III AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 

CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAAT T AAACAATGCAAGGGCACAGCAGGACAAGCACT CAT TAAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCTGGAAAACATT 
TCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAATTTTTTT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDT I LGLC ILALALSLAMMFTFRFI TTLLVHI FI SLVI LGLLFVCGVLWWL YYDYTNDLS I E 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 
N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 
N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 



App_ID=l 0063549 



Page 184 



FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 

ATAAT T T GTAT GGGAT T T GTGATGCAGGAAAGC C TAAGGGAAAAAGAATAT T CAT TC TGT GT GGT 

GAAAAT TT T T T GAAAAAAAAAT T GCC T T CT TCAAACAAGGGTGT CAT T CT GATATT TATGAGGAC 

TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 

AT TCAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAGTT CACT GT GCCT CAGAT CAAC T GC 

GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 

CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 

TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 

GG T TACAAAGGGAG T TAT T CCAACGGT GT CCAATCGT TAT CCCTACCACGATGGAGAGAAT CCT T 

TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 

C GAAAAGT CCAGCT GCCCAAGCAGGTGAGACCACAAAAGC C TATCAGAGGCCACC TAT T CCAGGG 

ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 

CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 

GCCACAGGAGCCAGGAGAT GGAT CT CT GGT CCACTGC CACC TACACAAGCAGCCAAAACAGGCCC 

AGAG C TGAT C CAGGT AT C CAAAGGC AAGAT C C T T C AG GAG CTGCCTTC CAGAAACC T G T T GGAGC 

GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 

T GGGAGAT CCAAACT GCAAAAT TGAC T TGTC GT T TT TAAT T GAT GGGAGCACCAGCATT GGCAAA 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 

CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 

ACACGAAT T CTCGAGATCT GAAGACAGCCATAGAGAAAAT TAC TCAGAGAGGAGGACT T TCTAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 

GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

GACT T GCGAGAGAGTCAGGAAT CAACAT T T T CT TCAT CACCAT TGAAGGTGCTGC TGAAAAT GAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 

AGAGTT TGAGAT TTCCGACACGGACACGCGCATCGGGGCCGT GCAGTACACCTACGAACAGCGGC 

T G GAG T T T G G G T T CGAC AAG TACAGC AGCAAGC C T GAC AT CCT CAAC G C CAT C AAGAGGGT G G GC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 

GCCCAAGAGGAGC TAGAAGTCAT TGCCAC T CACCC CGCCAGAGACCAC TCCTTCTTTGT GGACGA 

GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCAC 

AGCC TCGGAACTGAAT T CAGAGCAGGCAGAGCACCAGCAAGT GC TGC T T T ACTAAC TGACGT G TT 

GGACCACCCCACCGCT TAAT GGGGCACGCAC GGT GCAT CAAG TC T TGGGCAGGGCATGGAGAAAC 

AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 

T GATCACAAACGTATAGAAT GAGCCAAAAGGC TACAT CAT GT TGAGGGT GC T GGAGAT T T TACAT 

T T TGACAAT T GT T T TCAAAATAAATGT TCGGAATACAGTGCAGCCCT TACGACAGGC TTACGTAG 

AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 

T T T TGT CAT GACAATGTAGGAAT TGC TGAATTAAATGT T TAGAAGGATGAAAAATAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 

CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 

ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 

TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 

VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 

GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 

RSGAPNWVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENEKQYVVEPN 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 

LTKE FE I S DT DTRI GAVQYT YEQRLE FG FDKYS SKPDI LNAIKRVGYWS GGT S TGAAINFALEQL 

FKKS KPNKRKLM I L I T DGRS YDDVR I PAMAAHLKGVI TYAI GVAWAAQEE LE VI ATHPARDHS FF 

VDE FDNLHQYVPRI I QNI C TE FNS QPRN 

Important features: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACC AT GT CT TT T T TT CTGT T T T C AGAGTAGT TC AC AAC AGATCTG AGT GTT TT AATT AAGC AT GG AAT 
AC AG AAAACAAC AAAAAACT T AAGC T TT AATT T CATCT GGAATT CC AC AG TTTT CT TAGC TC CCT GGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGG ATGG CCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGC CTT C CC C ACT AC AAT GTGAT AGAACGCGT G AACT GGAT GT ACTT CT AT GAGT AT GAGCCGAT T T A 
C AGACAAG AC TT TCAC TT C ACACT TCGAGAGC AT T C AAACT GCT CTCATC AAAATCCAT T TCTGGT C ATTC 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
T GGT GGGGAT AT GAGGTT CTT AC ATT TT TCTT AT T AGGC C AAGAGGCT GAAAAGGAAGAC AAAAT G TT GGC 
fl% ATT G TC CTT AGAGGAT GAACACCT TCTT TATGGT GACAT AAT CCGACAAGAT TT TT TAGACACAT ATAAT A 

1=1 ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
111 AAGACAGACACT GATG TT TTC ATC AATACT GGC AAT TT AGTGAAGT AT CT TTT AAACCT AAAC CAC TCAG A 

4* GAAGTT TTTC AC AGGT TATCCT CT AATT GAT AAT T ATT CC TAT AGAGGAT T T T ACC AAAAAACCC AT ATT T 

,f\ 

™ CTT ACC AGGAGT ATCC TT TCAAGGTGT T CC CTCC AT ACTGCAGT GGGT TGGGT T AT AT AATGT CCAGAGAT 

TTGGTGCCAAGGATCT AT GAAATG ATGGGT CACGTAAAACCC ATCAAGTT T G AAGATGTT TAT GTC GG GAT 
CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
AT TT GGATGT CT GT CAAC TG AG ACGT GT GATT GC AG CCCAT GGC TTTTCTT C CAAGGAGATC ATCACT TTT 



Hi 



f|| TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 



GAT ACCT TGT GGAAAGTGT TAAATAAAGT AGGT ACT GTGGAAAAT T CATGGGGAGGTC AGTGTGCT GGCTT 
f|| AC ACTGAACT GAAACT CAT GAAAAACCC AGACT GGAGACT GG AGGGTT AC ACT T GT GATT TAT T AGTCAGG 

CCCT TC AAAG AT GAT ATGTGGAGG AAT T AAAT AT AAAGGAAT TGGAGGTT T T T GCTAAAGAAATT AAT AGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CT AGGC TGT AAAAACAAAAC AAT GTAGAGT TTT ATT T ATT GAAC AATG TAGT CACT TGAAGGTTTT GT GT A 
TAT C TT ATGT GGAT T ACC AATTTAAAAAT AT AT GT AGTT CTGTGTC AAAAAACTT C TT CACT GAAGTT AT A 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
AT T ATTTAAAAT TACT TC AACT TT GT GT TT T T AAATGTT T TGACGAT T TCAAT ACAAGAT AAAAAGGATAG 
TG AATC ATT C TT T AC ATGCAAACAT TTT CC AGTT ACTT AACT GATC AGTT T ATT AT T GAT AC AT CACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AGAGAAGAATT AAAGCAAGAAAATC T GAAAA 
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FIGURE 26 



MASALWTVLPSRMSLRSLKWSLLLLSL^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT T GCCAAGAGAGTAC ACAGTCAT TAATGAAGC CT GCCC TGGAGCAGAGTGGAATATCATGTGTCGGGAG TGC TGTG 
AAT ATGATC AGAT TGAGTGC GT C T GCCCCGGAAAGAGGGAAGTCGTGGGTTATAC CATCCC T T GC TGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCC AGGT TC TGC GAG CCCCAAAGGGTCAGAT TT TGT TGGAAAGC TATCC CCTAAATGCT CAC T GT GAATGGAC CATT CATGC 
TAAACCTGGGTT TGT CATCC AAC TAAGAT TT GT CATGT TGAGTC TGGAGT T TGACT ACATGTGC C AGTATGAC TATGT TGAG 
GTTCGTGATGGAGACAACCGC GATGGCCAGATC AT CAAGCGTGTC TGTGGCAACGAGCGGC CAGCTCC TAT C CAGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCC TCATCCCC T T GT T T CC ATGACGGC AC GT GC GTC C TTGACAAGGC T GGAT CT TACAAGTGTGCCTGC TTGGCAGGC 
Q TAT AC TGGGCAGCGC TGTGAAAATCTCC T TGAAGAAAGAAACTGC TC AGACCC T GGGGGCC CAGTCAAT GGGTAC CAGAAAA 

P\ TAACAGGGGGCC CTGGGC T TATCAACGGACGC CATGC TAAAAT TGGC ACCGTGGT GT CT TTC T T TTGTAACAACTCC TATGT 

fll T CT T AGT GGCIAATGAGAAAAGAAC TTGCCAGCAGAATGGAGAGTGGTC AGGGAAACAGCCCATC TGCATAAAAGCC TGCCGA 

Is § GAACCAAAGAT T TCAGACC TGGTGAGAAGGAGAGT TCTTC C GATGCAGG TTC AGT CAAGGGAGAC ACC ATTACACC AGC TAT 

ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
ATACC AAC ATCT GCATACCC AGCTCCAGTATGAGT GC ATC TC ACC C T TC TAC C GCCGCC TGGGCAGCAGCAGGAGGACATGT 
CTGAGGAC TGGGAAG TGGAGTGGGCGGGCACCATC C TGC ATCC CTATC TGCGGGAAAAT TGAGAACATC AC T GCT C CAAAGA 
CCC AAGGGTTGCGCT GGCCGTGGC AGGC AGC CATC T ACAGGAGGACCAGCGGGGT GC ATGACGGCAGCC TACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCAC CATGAT CAAGACAGC AGACC TGAAAGT TGT T TTGGGGAAATT C TAC C GGGAT GATGAC CGGGAT GAGAAGAC CAT CC 
AGAGC CTACAGATTT CT GC TATCAT TC TGCATC CCAACTATGACCCC ATCCT GCT TGATGC TGACATC GCCAT CC TGAAGC T 
C CT AGACAAGGC CCGT AT CAGCACCCGAGTC CAGC C CATC TGCCTCGC TGC CAGTC GGGATC TCAGCAC T T CC TTCC AGGAG 
TCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGG 
TGGTCAGTGTGGTGGAC TC GC TGC TGTGTGAGGAGCAGCATGAGGACC ATGGC ATCCCAGT GAGTGTC ACT GATAACATGTT 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCA 
C TGCC TTC ACCAAGGTGCTGC CTT T TAAAGAC TGGATTGAAAGAAATATGAAATGAACCATGC TC ATGCAC TCCT TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCC AGGGCT T C TGAC T TCAGGGAC^AAAAC TC AGTGAAGGGTGAGTAGACCT CC AT TGC T GGTAGGC TGATGCCGC GTCC A 
CTAC T AGGACAGCCAAT TGGAAGATGCCAGGGC TTGCAAGAAGTAAGT TTC T TCAAAGAAGACCATATACAAAAC C TCTC CA 
CTCCACTGACC T GGTGGTC T TCCCC AACT TT CAGT TATAC GAATGCCATCAGC TTGACCAGGGAAGATC TGGGCT TC ATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGC CACAGTAC AGTC TGGTC CT T TTCCTTCCC CATC TCT TGTAC AC AT TT TAATAAAATAAGGG TT GGC T TC T 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 



MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
I PCCRNEENECDSCL I HPGCT I FENCKSCRNGS WGGT LDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWS GKQP I C IKACRE PK I S DLVRRRVLPMQVQSRE TPLHQLYS AAFSKQKLQS APTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALWERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QSLQI SAI I LHPNYDP I LLDADI AILKLLDKARI S TRVQPICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



? 5s 
Q 



in 



GGTT C CT ACAT C CT CT CATCT GAGAAT CAGAGAGCATAAT CTT CTTAC GGGCC CGT GATTTAT TAAC GTGGCT TAAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTT GGGC C CTT GT AGCT GACAGAAGGT GGCCAGGGAGAAT G CAGCACACT GCT CGGAGAMIGAAG GCGCTT CT GT TGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
T CACAGCCACGGCTC CCT CC CCAGAGGTTT CT GCAGCT GCCAC CAT CT CCTTAAT GACAGACGAGCCT GGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACG GCC CTTT GAGAGAT CCACT AT TAGAAGCAGAT CATT TAAAAAAATAAAT CGAGCTTT GAGT GTT CT TCGAA 
GGACAAAGAGCGG GAGTGCAGTT GCCAACCATGC CGACCAG GGCAGGGAAAATT CT GAAAACAC CACT GC CCCT GAAG 
T CTT T CCAAGGTT GTACCAC CTGATT CCAGATGGT GAAATTAC CAGCAT CAAGAT CAAT C GAGTAGAT C C CAGT GAAA 
GCCT CT CT ATTAGGCT GGT GGGAGGTAGC GAAACC CCACT GGTCCATAT CATT AT CCAACACAT TTAT CGT GAT GGGG 
TGAT CGCCAGAGACGGCC GGCTACT GCCAGGAGACATCAT T CTAAAGGT CAAC G GGAT GGACAT C AGCAAT GT CCCTC 
ACAACT ACGCT GT GCGTC T C CTGC GGCAGCCCT GCCAG GT GCT GT G GCT GACT GTGAT GCGTGAACAGAAGTT CCGCA 
GCAGGAACAAT GGACAGGCC CCGGAT GCCTACAGACCCCGAGAT GACAG CTTT CAT GT GAT TCT CAACAAAAGTAGCC 
CCGAG GAGCAGCTT GGAATAAAACT GGT GCGCAAGGTGGAT GAGC CT GGGGTT TT CATCTTCAAT GT GCT GGAT GGCG 
GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCIAATGGACATGATCTTCGATATGGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCCCT GACAT CTT T CAGGAAGCCGGCTGGAACAGCAATGGCAGCT GGT CCC CAGGGCCAGGGGAGAGGAG CAACA 
CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
GCAT GACC GTCGCAGGGGGAGCAT CACATAGAGAAT GGGATTT GCCTAT CTAT GTCAT CAGT GTTGAGC CCGGAGGAG 
T CATAAGC AGAGAT GGAAGAATAAAAACAGGT GACATT T T GTT GAAT GT GGAT GGGGT CGAACT GAC AGAGGT CAGCC 
GGAGTGAGGCAGT GGCATTAT T GAAAAGAACAT CAT C CT CGAT AGT ACT CAAAGCT TT G GAAGT CAAAGAGTAT GAGC 
C CCAGGAAGACT GCAGCAG CCCAGCAGCCCT GGACT CCAAC CACAACAT GGC C CCACC CAGT GACT GGT CCCCATCCT 
GGGT CAT GT GGCT GGAATTACCACGGT GCTT GTATAACT GT AAAGATAT TGTAT TACGAAGAAACACAGCT G GAAGT C 
TGGGCTT CT GCATTGTAGGAGGT TAT GAAGAATACAAT G GAAACAAACCTTT TTTCAT CAAAT CCAT T GTT GAAGGAA 
CAC CAGCATACAAT GAT GGAAGAATTAGATGT GGTGATATT CTT CTT GCTGT CAAT G GTAGAAGTACAT CAGGAATGA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TT TT ATAGA AT CAAT GAT GG GT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTATATTTAT C 
TT GT CAGTTTT TATATT TAAAGAAAGAATACAT T GT AAAAAT GT CAGGAAAAGTAT GAT CAT CTAAT GAAAGC CAGTT 
ACAC CT CAGAAAATAT GAT TCCAAAAAAATTAAAACTACTAGTTTT T TT TCAGT GT GGAGGATT T CT CATT ACT CT AC 
AACATT GTT TATAT TT TTT CTATT CAATAAAAAGCC CTAAAACAACT AAAAT GATT GAT TT GTATACCCCACT GAATT 
CAAGCT GAT TT AAATT T AAAATT T GGTATAT GCT GAAGT CT GCCAAGGGTACAT TAT GGC CATTTTT AATTTACAGCT 
AAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAAACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE do 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I SLMTDEPGLDNPAYVS SAEDGQPAI S PVDSGRSNRTRARPFERS TIRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVIAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHREWDLPIYVI SVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEWEYEPQEDCSSPAALDSNHNMAPP 
S DWS PS WVMWLELPRCLYNCKD I VLRRNT AGSLGFC I VGGYEE YNGNKP FFI KS I VEGT PAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
AT AAGACC C T CAT T T T GGGGAAG G GAAAAAC T C T GACTAAT GAAG CAT C CAC GAAGAAG GT AGAAC TT 
GACAACTGTCCTTCTGT GT CT CCT TACCT CAGAGGCCAGAGCAAGCTCATTT T CAAACCAGATCT CAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
Kl CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
f*| CAT C T GC ATCC C T T C C T GCAGAG GC AGC AGC T GGAT TAT G GCAT CTAC G T CAT C CAC CAGGCT GAAGG 

O TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
I? 3 AC T G C T T TAT AT T CCAC GAT GTG GAC C T G GT AC C CGAGAAT GAC T T T AACC T T TAC AAGT GT GAGGAG 



PI 



CATCCCAAGCAT CTGGTGGT T GGCAGGAACAGCACT GGGTACAGGTT ACGT TACAGT GGATATTT TGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAA.GTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
GATGAAGCTCT TACACCAAGT GTCACGAGTC T GGAGAACAGAT GGGT TGAGTAGTT GTT CTTATAAAT 
TAGTATCTGTGGAACACAAT CCTTT ATATATCAACAT CACAGTGGATTT CTGGTTT GGT GCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
AT AG TAG CAC AC AT T AAGAAC CT GT TAC AGC T CAT T G T T GAG C T GAAT TTTTCCTTTTTG TAT T T T CT 
TAGCAGAGCTCCT GGTGAT GTAGAGTATAAAACAGT TGTAACAAGACAGCTTT CT TAGTCATT TT GAT 
CAT GAGGGT TAAATAT T GT AATAT G GAT ACT T GAAG GAC T T TAT ATAAAAG GAT GAC TCAAAGGATAA 
AATGAACGCTATTTGAGGACTCT GGTTGAAGGAGAT TTAT T TAAATTTGAAGTAATATAT TAT GGGAT 
AAAAG GC CACAG GAAATAAGACT GC T GAAT GTCT GAGAGAACC AGAGTT GT T C T C GT CCAAGGT AGAA 
AGGTACGAAGATACAATACT GTTAT TCATTTAT CCT GTACAATCATCTGTGAAGTGGTGGT GTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGC GAC GAAT CAGGACACAGTGAACTT GGGAAT GAAGA 
G GTAG CAG GAGG GT G GAGT G T CGGC T GCAAAGGCAG CAGT AGC T GAGCT GGT T GC AG GT GC T GAT AGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTTTAAAGAGT TTTT GTAAAATGAT TTT GTACAAGT AGGATAT GAATTAGCAGTT TACAAGTT TACAT 
AT T AAC T AAT AAT AAAT AT G T C TAT C AAAT AC C T C T G TAG T AAAAT G T G AAAAAG CAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GCTCAAGACCCAGCAGT GGGACAGCCAGACAGACGGCAC GATG GCAC T GAGCTCCCAGAT C T GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACT T GCAGAGC TGCAACCCCAGGACAGAGCT GGAGCCAGGGCCAGC TGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAZ^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

CP 

III N-myristoylation site. 

Ui amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

ri amino acids 1-12 

in 



fii 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCASAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC T TCAACACAACCCCT CT TGTCACCATACAGCCAGAAGGGGGCAC TAT CATAGT GACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGAC T CAGGGAT CTAC TATGT GGGGATATACAGC T CAT CACT CCAGCAGCCC TCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAAT GGCACC TGTGT GACCAATC TGACAT GCTGCATGGAACAT GGGGAAGAGGATGT GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCC7\ATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTT TC TGAAGAGAGAGAGACAAGAAGAGTACAT T GAAGAGAAGAAGAGAGTGGACAT TTGTCGGG 
AAAC T CCTAACATATGCCCC CAT TC TGGAGAGAACACAGAGTACGACACAAT CCCTCACAC TAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAAT CCCCACT CACT GC TCACGAT GCCAGACACACCAAGGC TAT TTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE A6 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERYDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFS S P I LARKLCEGAADDPDS SMVLLCLLLVPLLLSLFVLGL FLWFLKRERQEE Y I E 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 



jsA Important features: 

y Signal peptide: 

^ amino acids 1-22 

lis 

||! Transmembrane domain : 

*B amino acids 224-250 

Leucine zipper pattern. 

|p amino acids 229-251 

**% 

fU N-glycosylation sites. 

CI amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 

291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTGTGAAT TT TCATTGAAAAACATCAGTGACAT TCAT CCAGAATCCTTCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCT GGAGAGCATCTAGT TTCCACTTCGAT TCTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAG TT TAAT GGGAATAAAATGTAAGTAT CAGTAGT T T GAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCMRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNI SDIHPES FNLQWFFNDSCAPPTGFNKPTSNDTMASGWRAS S FHFDSEENKHRL IHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features : 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGT TGCTGACCAAT TGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
h h CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
|;;f TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
JjJ TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 

III 

%u 

m 

|? : § s 
g ^? 

n 

m 
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FIGURE *o 

MERVTLALLLIiAGLTALEANDP FANKDDP FY YDWKNLQLS GL I CGGLLAI AG I AAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 



Important features: 
Signal peptide: 

amino acids 1-16 

H Transmembrane domain: 

_j5* amino acids 36-59 

1*1 

Ill- 
ill N-myristoylation sites. 



■if? 



ill 



amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 



G T GGAC T C T GAGAAGC C CAG GC AG T T GAG GAC AG GAGAGAGAAGGC T GC AG ACC CAGAG G GAG GG 
AGGACAGGGAGT C GGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCAC T GGGACAAATAT TGGGGAGGCCCT TGGACATGGCCT GGGAGACGCCC TGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGCT TTGGAATGAATCCTCAGGGAGCTCCC TGGGGT CAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGT GGTGGCAGCAGCAGT GGCAGCAGCAGT GGCAGCAGCAGT GGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T T AAAT CCAAGC TGGGT T T CATCAAC TGGGATGC C AT AAAC AAGGACCAGAGAAGC T C TC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
^^M^AAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *2 

MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAI GKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGWTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 



Signal peptide: 

amino acids 1-21 



N-glycosylation site. 

amino acids 265-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myris toylation si te . 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATOTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
%4> AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
13 GAAGT CCTATATAACGAT CT TCAACAAGAGT GCAAACAT CATGCT TGACAAGTGGCAGCACCT GG 

W CCTCAGAGGGCAGCAGTCGTCTGGACATGTT TGAGCACATCAGCC TCATGACCT TGGACAGTCTA 

Tl CAGAAAT GCATCT TCAGCT T TGACAGCCATT GTCAGGAGAGGCCCAGT GAAT ATAT TGCCACCAT 

H C TTGGAGCT CAG TGCCC T T GTAGAGAAAAGAAGCCAGCATATCCT CCAGCACAT GGAC T T T C T GT 

„g ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
*0 GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
* w AGCCAAGT C CAAGAC T T T GGAT T TCAT TGAT GT GC T T CT GCTGAGCAAGGAT GAAGATGGGAAGG 

CAT T GT CAGAT GAGGATATAAGAGCAGAGGC TGACACCT T CAT GT TT GGAGGCCAT GACACCACG 

IP I 

f*| GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
HI ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
13 AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
5 ^ TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 

MSLLSLPWLGLRPVT^SPWLLLLLWGSWLI^IIJIWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
s i LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
in LAF I P FS AG PRNC I GQAFAMAEMKWLALMLLH FR FL PDHTE PRRKLE L IMRAE GGLWLRVE PLN 

vglq 

III Important features: 

%ll Transmembrane domains: 

~R amino acids 13-32 (type II), 77-102 

P Cytochrome P450 cysteine heme-iron ligand signature. 

Ill amino acids 461-471 

J«l N -glycosylate on sites. 

f|| amino acids 112-116, 168-172 
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FIGURE 5 5 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA. T TCATGGC CAGT TT TATGAAGC T T T GGAAGGCAC TAT GGACAGAAGCT GGT GGACAGT TT T 

GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGAT GAGAAG T GGGT TC TGTATCT T GTGGAGT GGAATC T T CC T CAT GTACCT GT T TCCTC 
TC TGGATGT TGTCCCACTGAAT TCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



App_ID=10063549 



Page 207 



FIGURE *6 

MGPVKQLKRMFE PTRL I AT IMVLLCFALTLCS AFWWHNKGLAL I FC I LQS LALT WYSLS FI PFAR 
DAVKKC FAVC LA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTT T CAAGCT CAGT GTCT T CAT C CCCT C CCAGGAATT CT C CAC CT AC CGC CAGT G GAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGACCTT GAT GG GCAGCT AGACTTT GAAGAAT TTGT CC ATTAT CT C CAAGAT CAT GAGAAGAAGCT GAGG 
CTGGT GTTTAAGAT TTTGGACAAAAAGAAT GAT GGACGCAT T GACGCGCAGGAGAT CAT GCAGT C CCT GCGGGACTTG 
GGAGT CAAGATAT CT GAACAGCAGGCAGAAAAAATT CT CAAGAGCATGGATAAAAACGGCACGAT GACCAT CGACT GG 
AAC GAGT GGAGAGACT AC CACCT C CT CCACC CCGT GGAAAACAT C CCCGAGAT CAT CCT CTACT GGAAGCATT C CACG 
ATCTT T GAT GT GGGT GAGAAT CTAACGGT CC CGGAT GAGTT CACAGTGGAGGAGAG GCAGACGGGGAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GCCT CCCGCAGCAACAACAT GGGCAT CGTTGGTGGCTT CACT CAGAT GATT CGAGAAGGAGGGG C CAGG 
TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
ATCGC C CAGAGCAGCATCTACC CAAT GGAGGTCCT GAAGAC CCG GAT G GC GCT GCGGAAGACAGGCCAGTACT CAGGA 
|3 ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
F* * GGCAT CAT CC CCTAT GC C GG CAT C GACCTT GCAGT C TAC GAGAC GCT CAAGAATGC CT GGCTG CAGCACTAT GCAGTG 

AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
|H CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
| a § TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 

G CT GT GAG CAT CAGCT ACGT GGT CTACGAGAACCT GAAGAT CAC CCTGGGC GT GCAGT CGCGGTGACGGGGGGAGGGC 
|I1 CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
'g« AAGCT GTCT C GAGCCAAGCT GTGAAAACC CTAGAC GCACC CGCAGGGAGGGT GGGGAGAGCTG GCAGG C CCAG GGCTT 

*£J GTCCT GCT GAC CC CAGCAGACCCTCCT GTT G GT T C CAGCGAAGAC CACAGGCATT C CTTAGGGT CCAG GGT CAGCAGG 

V3 CTC C GGGCTCACATGT GT AAGGACAGGACATTT T CT GCAGT GC CT GCCAAT AGTGAGCT T GGAG C CTGGAGGCCGGCT 

TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
r ^ CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
O ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
Iff TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
"Z: CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
I J G GCCT GGACC CT GT CAGGAT GGGC CCC ACCT CAGAACCAAACTCACTGT CC CCACT GT GGCAT GAGGG CAGTG GAGCA 

ffp CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
**J ACT GTT GG GAAAAGGGTT TTGT CCAGAAGGACAAGC CGGACAAAT GAGCGACTT CT GTGCT T CCAGAG GAAGACGAGG 

O GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
fl I GGGAC GAG CCCC ACAT TCCACTTGT GTCACT GCTT G GAACCTATT TAT T TT GTATTTATTTGAACAGAGT TAT GTCCT 

S <;!5? AACTAT TTTTATAGAT TT GTTTAATTAATAGCT T GT CATTTTCAAGTTCAT TTT TTAT T CATAT T TAT GTTCAT GGTT 

GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCC^GTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAATCT T C AATAGGAT GCAAAGAT CAAT GCAAAAATTGTT ATATAT GAACATATAACT GGAGT C GT C AAAAAG 
CAAATTAAGAAAGAATTGGACGTTAGAAGTTGTC^TTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *8 

MLCLCLYVPVI GEAQTE FQY FE S KGL PAE LKS I FKL S VFI PSQE FS T YRQWKQK I VQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMTIDWNEWRDYHLLHPVENIPEIILYWKHSTIFDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQI KRLVGS DQET LR I HERLVAGS LAGAI AQS S I YPMEVLKTRMALRKTGQYS GMLDCARR 
I LAREGVAAFYKGYVPNMLGI I PYAGIDIAVYETLKKAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAVSISW 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains : 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGT TCAGAGGCCGGACAGCAGTGT TTGCT GATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
tt CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
Q AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 

^ s CAGC C CACAG T GG T C T GG GCAT C C CAAGT T GAC CAG GGAG C C AAC T T C T C GGAAG T C T C C AAT AC 

III 

m CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

It* TCAACAACACAT ACT CC TG TATGAT TGAAAATGACAT TGC CAAAGCAACAGGGGATAT CAAAGTG 

*|| ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 

* CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 

?2 GT GCC TT GGCCACAAAAAAGCATGCAAAGT CAT TGT TACAACAGGGAT CTACAGAAC TATT T CAC 

S it 

1^ CACCAGAT AT GACC T AGT T T TAT AT T T CT GGGAGGAAAT GAAT T CATAT C T AGAAGT CT GG AG TG 

p J AGCAAACAAGAGCAAGA7^CAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
Q- CTAT CT TCAAAGACATAT TAGAAGTTGGGAAAATAATT CAT GTGAAC TAGACAAGT GT GT TAAGA 

W GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 

CACCTGGGGAGTGAGAGGACAGGATAGTGCATGT TCTT TGTCTCTGAATTTT TAGT TATATGT GC 
TGTAATGT TGCTC TGAGGAAGCCC CT GGAAAGTC TATC CCAACATAT C CACATC T TATATT CCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TC TTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAAT TTTAGCATAAACAGAGCAGT 
CGGGGACACCGAT TT TATAAATAAACTGAGCACCT TCTT TT TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MASLGQILFWS I IS I I I ILAGAIALI IGFGISGRHS ITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVWDYNASSETLRCEAPRWFPQPTVVWASQVDQGANFSEVS 
NTS FELNSENVTMKWSVL YNVT INNT YS CMI ENDI AKATGD IKVTE SE I KRRS HLQLLNSKASL 
CVS S FFAI SWALLPL S PYLMLK 

Important features : 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 

CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 

GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 

TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 

GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 

GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 

GGTGGACT CAGATCACAGT GGC TAT AT CT CCATGAAGGAGC TAAAGCAGGCCC TGGTCAACT GCA 

ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 

GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 

CCAGCAGTATGACCGGGACC GC T CGGGC T CCAT TAGC TACACAGAGC T GCAGCAAGCTCTGT CCC 

Vri AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

«jS GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

S3 GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 

* CCAT GACAGCT TC TC GGAT GCTATGA.C CCAACCATC T GTGGAGAGTGGAGT GCACCAGGGAC C TT 

' 

m TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 

111 

f*% ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
ill AATAGT GAGGACCGGGGCT GAGGCCACACAGATAGGGGCCT GATGGAGGAGAGGATAGAAGT T GA 

CI ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATC T G TC T AT GGGACCAGTGGCT TGGATT CT GCCACACCCAT AAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALTOCNWSSFNDETCLmiNMFDKTKSGRIDVYGFSALWKFIQQW 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGN I RL S FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6s 



JUL 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

I 

I TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
* ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
J CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
Jij GGGC TAGAGCACGATAGAGGGAAACC CAACAT TGGGAGT TAGAG TCC TGCTCCCGCCCCTTGCTG 

f|| TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
C3 T GCTCATC TCCAATAAAAT AAAAGCACTTAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

W AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



5 
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FIGURE 64 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



|.4 Important features of the protein: 

|!j Signal peptide: 

W amino acids 1-26 

CP 

III 

2J* Casein kinase II phosphorylation sites. 

jj amino acids 56-59, 155-158 

! *y 

^ N-myristoylation sites. 

^ amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

fi 259-264, 269-274, 270-275, 280-285, 281-286, 305-310 

; ;:r--r? 

ru 

Ill 
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FIGURE 65 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATCGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACAC ATGA CAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCC ATG GCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAGG TCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTAT TCACTT TTATATAT TTATATAAAATTAG 
TAGTGAGATGTAAAAAAAAA&AAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLP FLLARWGRAWGQ I QT TSANENS TVLPSS TS S S S DGNLRPEAI TAI I WFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 2 6-32 
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FIGURE 6Q 



GCCAGGAATAACTAGAGAGGAACA ATG GGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
T CAAATACTT CCTT CATT AAG CT GAAT AATAAT GGCTT T GAAGATATTGT CATTGT TATAGAT CCTAGT GTGCCAGAA 
GAT GAAAAAAT AAT T GAACAAATAGAGGATATG GT GACTACAGCTT CTAC GTACCT GT TT GAAGC CACAGAAAAAAGA 
TTTTTTTT CAAAAATGTAT CTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATGAA 
AACCAT AAACAT GCT GAT GTTAT AGT TGCACCAC CTACACT CCCAGGTAGAGAT GAAC CATACAC CAAGCAGTTCACA 
GAAT GT GGAGAGAAAGGCGAATACAT T CACTT CACC CCT GACCTT CTACTTGGAAAAAAACAAAAT GAATATG GACCA 
C CAG GCAAACTGTT T GTCCAT GAGT GGGCT CACCT C CGGT GGG GAGTGTTT GAT GAGT ACAAT GAAGAT CAGC CTTTC 
TAC C GT GCTAAGT CAAAAAAAAT C GAAG C AACAAGGT GTT C CGCAGGT AT CTCT GGTAGAAAT AGAGTT TATAAGT GT 
C AAGGAGGCAGCT GT CTTAGTAGAGCAT GCAGAATT GATT CTACAACAAAACTGTATGGAAAAGAT TGT CAATTCTTT 
CCT GATAAAGTACAAACAGAAAAAGCAT CCATAAT GTTTAT GCAAAGTATT GATT CTGTT GTT GAATT T T GTAACGAA 
AAAACCCATAATCAAGAAGCTCCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAAT 
I i TCT GAGGAT TT TAAAAACACCATACCCAT GGT GACACCACCTCCT C CACCT GTCT T CT CATTGCT GAAGAT CAGT CAA 

^ AGAATT GT GT GCTTAGTT CTT GATAAGTCT GGAAGCAT GGGGGGTAAGGACCG C CT AAAT CGAATGAAT CAAGCAGCA 

lil AAACATTT CCTGCT GCAGACT GT T GAAAAT GGAT CCTGGGT GGGGATGGT T CACTTTGATAGTACT GCCACTATT GT A 

|«J AATAAGCTAAT CCAAATAAAAAGCAGTGAT GAAAGAAACACACT CAT GGCAGGATTACCTACATAT CCT CT GGGAGGA 

t!!! ACTTCCATCT GCT CT G GAATTAAATATGCATTTCAGGT GATTGGAGAG C TACAT T CCCAACT CGAT GGAT CCGAAGTA 

Us CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
||| TTTATT GCT TT GGGAAGAGCT GCT GATGAAGCAGTAATAGAGAT GAGCAAGATAACAGGAGGAAGT CATTT TTAT GTT 

SI T CAGATGAAGCTCAGAACAAT GGC CT CATT GAT GCTTT T GGGGCT CTTACATCAGGAAATACT GAT CTCT CCCAGAAG 

Ifl T CCCT T CAGCT CGAAAGTAAGGGAT T AACACT GAATAGTAATGC CT GGAT GAACGACACTGT CATAATT GAT AGTACA 

jj GT G G GAAAGGACACGT TCT TT CT CAT CACAT GGAACAGT C T GCCT C CCAGTATT T CTCT CTGGGAT CCCAGTGGAACA 

L ATAATGGAAAATTTCACAGTGGATGCAACTTCCAAAATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACT 
%M& T GG GCATACAATCT T CAAGCCAAAGCGAACC CAGAAACAT T AACTATT ACAGTAACTT CTCGAGCAGCAAATT CTTCT 

„ GT G CCT C CAAT CACAGT GAAT GCTAAAAT GAATAAGGAC GTAAACAGT T TC CC CAGCCCAAT GATTGT TTACGCAGAA 

ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
TTGGAACT TTT GGATAAT GGT GCAGGCGCTGATT CTTT CAAGAATGATGGAGT CTACT C CAGGTAT T T TACAGCATAT 
||| ACAGAAAAT GGCAGATATAGCTTAAAAGTT C GGG CT CAT GGAGGAGCAAACACT GCCAGGCTAAAATTACGGCCTCCA 

CTGAATAGAGCC G CGT ACAT AC CAGGCT GGGTAGT GAACGGGGAAAT T GAAGCAAACC CGCCAAGACCT GAAATT GAT 
W GAGGATACT CAGAC CAC CT TGGAGGAT T T CAGCC GAACAGCATCCGGAGGT GCATT TGT GGTAT CACAAGT CCCAAGC 

ff J CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
jS ACATGGACAGCACCAGGAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
W GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 
J || AGCTTT GCATT TAAACCAGAAAATAT CT CAGAAGAAAAT GCAACCCACATATTT AT T GCCATT AAAAGTAT AGATAAA 

AGCAATTTGACAT CAAAAGTATCCAACATTGCACAAGTAACTTT GT T TATC CCTCAAGCAAAT C CT GAT GACATTGAT 
C CTACACCTACT C CTACT CCTACT CCTACTC CT GATAAAAGT CATAATT CT G GAGTTAATATTT CTACGCTGGTATTG 
T CT GTGATT GGGT CTGTT GTAAT T GT TAACTTTATTT TAAGTAC CACCATT TGAA C CTT AAC GAAGAAAAAAAT CTTC 
AAGTAGAC CT AGAAGAGAGTT TTAAAAAACAAAACAATGTAAGTAAAGGAT AT TT CT GAAT CT TAAAATTCAT CCCAT 
GT GT GAT CATAAACTCATAAAAATAATTTTAAGAT GT CGGAAAAGGATACTTT GAT TAAATAAAAACACTCAT GGATA 
TGTAAAAACTGT CAAGATTAAAATT TAATAGTT T CAT TTATT TGTT AT TTTATT TGTAAGAAATAGT GATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAAT CAT CTATCTGAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATT T GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMEMQSI'DSVVEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIWKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVS DEAQNNGL I DAFGALT S GNTDLSQKS LQLESKGLT LNSNAWMNDTVI IDS TVGKDT FFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADS FKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYI PGWWNGE IEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI I R I SAS I LDLRDS FDDALQVNTTDLS PKEANSKES FAFKPENI S EENATHI FI AI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



FIGURE 71 



13 



CT C CT TAGGT GGAAAC C CT GGGAGTAGAGTACT GACAGCAAAGACC GGGAAAGACCATAC GTCCC C GGGCAGGGGTGA 
CAACAGGT GT CAT CTT TTT GATCT CGT GT GT GGCT GCCT T C CTATTT CAAGGAAAGACGCCAAGGTAATTT T GACCCA 
GAG GAGCAAT GAT GTAGCCACCT CCTAAC CTTCC CTTCTT GAACCC CCAGTTAT GCCAGGATTT ACTAGAGAGT GTCA 
ACT CAACCAGCAAGC GGCTC CTT C GGCT TAACT T GT GGTT GGAGGAGAGAACCT TT GT GG GGCT G C GT TCT CT TAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGT GATGGGAAT GAAGGT GAAAACTT GGAGAT TTCACTTCAGT CATT GCT T CT GCCT GCAAGATCAT CCT TTAAA 
AGTAGAGAAGCT GCT CT GTGT GGT GGTTAACTC CAAGAGGCAGAACTCGT T CTAGAAGGAAAT GGATGCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTG AATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACT GCC CAGGGCCAACAGC CCCACGGGGAAG GAGGG GTACCAG GC C GT CCT TCAGGAGT GGGAGGAGCAGCAC 
CGCAACTACGT GAGCAGCCT GAAGCGGCAGATCGCACAGCT CAAGGAGGAG CT GCAGGAGAGGAGT GAGCAGCT CAGG 
AAT GGG CAGTACCAAGCCAGC GAT GCTGCT G GCCT GGGTCT GGACAGGAGC CCCC CAGAGAAAACCCAGGCCGACCTC 
CTGGC CTT CCT GCACT CGCAGGT GGACAAGGCAGAGGTGAAT GCT GGC GT CAAGCT GG CCACAGAGTATGCAG CAGTG 
CCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCTGTG 
*f AGGAAGGACAAGC GGGAT GAGTT GGT GGAAGCCAT T GAAT CAGC CTT GGAGAC CCT GAACAAT CCT GCAGAGAACAGC 

ft! CCCAAT CACC GT CCTTACAC GGCC TCTGATTTCATAGAAGGGAT CTAC CGAACAGAAAGGGACAAAGGGACATT GT AT 

lis GAGCT CAC CTT CAAAGGGGACCAGAAACACGAATT GAAAC GGCT CAT CTT ATTT CGAC CATTCAGCCCCAT CAT GAAA 

GT GAAAAAT GAAAAGCT CAACAT GGCCAACACGCTTAT CAATGT T ATCGT GCCT CTAGCAAAAAGG GT GGAC AAGTTC 
Iff C GGCAGTT CAT GCAGAATTT CAGGGAGAT GT GCATT GAGCAGGAT GGGAGAGT C CATC T CACT GTT GTTTACT TT GGG 

Wj AAAGAAGAAATAAATGAAGT CAAAGGAATACTT GAAAACACTT CCAAAGCT GCCAACT T CAGGAACTT TACCT T CATC 

CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
%Q TTTTTCTGT GAT GT GGACAT CTACTTCACAT CT GAATT C CT CAATACGT GT AGGCT GAAT ACACAG CCAGGGAAGAAG 

GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
„ CAGCAGCT GGT CATAAAGAAGGAAACTGGAT TTT GGAGAGACTTTGGAT T T GGGAT GACGT GTCAGTAT CGGT CAGAC 

13 TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 

CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAG 
CTGAC CCC CGAGCAGTACAAGAT GT GCAT GCAGT CCAAGGC CAT GAAC GAGGCATCCCACGGCCAGCT GG GC AT G CTG 
GTGTTCAGGCACGAGATAGAGGCT CACCTTCGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACTCCCAGA 
GAAGGATT GT GGGAGACACT T TT T CTTT C CTTTT GCAATTACT GAAAGT GGCT GCAACAGAGAAAAGACTT CCATAAA 
GGAC GACAAAAGAATT GGACT GAT G GGTCAGAGAT GAGAAAGCCT C CGATTTCT CT CT GTT GGGCT TTTTACAACAGA 
|J AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
p | AGATTATAAGCCTAAT GGT GT GGAGGTTTT GAT GGT GT TTACAATACACT GAGACCTGTT GTTTT GTGT GCTCAT TGA 

AATATTCATGATT T AAGAGCAGT TTT GTAAAAAATT CATTAGCAT GAAAGGCAAGCATATT T CT C CTCAT AT GAATGA 
GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTATGATACT 
AGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCGTGTCATATTTTCCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTT TTCCCTT GT GAGTT ATAGTCTGCT TATTTAATTACCACTTTGCAAGC CT TACAAGAGAGCACAAGTT GG C CTAC 
ATTT TTATATTTTT TAAGAAGATACT TTGAGAT GCATTATGAGAA.CTTT CAGT T CAAAG CAT CAAATT GAT GC CATAT 
CCAAGGACAT G CCAAAT GCT GATT CT GTCAGGCACT GAAT GTCAGGCAT T GAGACATAGGGAAGGAAT GGTTT GT ACT 
AATACAGACGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGAC 
ACTT T CTGCTTTACAGAAAAGGAAACT CATT CAGACTGGT GATAT CGT GAT GTACCTAAAAGT CAGAAAC CACAT TTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGGGTG CT CCTCCT GG CTT CT GGCTT CCATAAGAAGAAATGGAGAAAAATATAT ATATATATATATATAT TGT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAATT ATTTTTTAAAT TAAGCAGT TCTACT CAAT CACCAAGAT GCTTCT GAAAAT T GCATTTTATTACCATTT 
CAAACT AT TTT TTAAAAATAAATACAGTTAACATAGAGT GGTTT CT TCATT CAT GT GAAAATTATTAGCCAGCACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAGGT GGTATT GCACAGCTAATAAAATAT GAT TT GT GGATAT GAA 



111 
0 
III 



App_ID=l 0063549 



Page 223 of 320 



FIGURE 72 

MMMTORGLLAWISRVVVLLVLLCCAI SVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGWLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRPYTAS DFI E G I YRTERDKGTLYELTFKGDHKHE FKRL I L FRPFS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRN^ 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVWTPTOGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 23 

GAGACT GCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT TT GTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGC AGAGAT AAAT ACACT CACGCCAGGAGC TCGCTCGCTCTCTCTCTCTCTCTCT CACT CC TC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT GCCCAGTCGCCCAT CGATAT TCAGACAGACAGTGTGACAT TT GACCCTGAT T T GCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
{J ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
CP GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
W TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 

m 

%i AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
.fl AATATAGCTT AT GAACACAT TC T GAGT CAC T TGCAT GAAGT CAGGCAT AAAGATCAGAAGACCTC 

s AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
*3 GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 

AT T TCAATGGAACAGC T GGAAAAGC TT CAGGGGACAT T GT T C TCCACAGAAGAGGAGCCC T C T AA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGT GTGGTCT T CACCTCAGCACAAGCCACGACT GAGGCATAAAT T CC T T CT CAGAT AC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGC TGTCAT T CCAGGAAGAAC T GCAGAGCC T TCAGCCTC TCCAAACATGTAGGAGGAA 
AT GAGGAAAT CGC T G T GT TGTTAAT GCAGAGANCAAAC T C T GT T TAGT TGCAGGGGAAGT T T GGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CT CCT TAGGATAAAGAGTT GC TGT TGAAGT T GTATATT T T T GAT CAATATAT T TGGAAAT T AAAG 
TTTCTGACTTT 
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FIGURE 75 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGT TTGCCT AAACC TGCAAACATCACCT TC T TATCCAT CAACATGAAGAATGT CC TACA 
ATGGACTCCACCAGAGGGTCTTCAAGGAGT TAAAGT TACT TACACTGTGCAGTATTTCATCACAA 
ATTGGCCCACCAGAGGTGGCACT GACTACAGATGAGAAGT CCAT T T CT GT T G TCCTGACAGC TCC 
AGAGAAGT GGAAGAGAAATCCAGAAGACC T TCC TGT T T CCATGCAACAAATATAC TCCAATC TGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
5 . ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
« AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
Q CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
W« T CTGT GAT GGGCTAT TCCAT CT AC CGATAT ATCCAC GT TGGCAAAGAGAAACACC CAGCAAAT T T 

GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
%i ACT T TATCACC C TCAATATC TCGGAT GAT T CTAAAATT TC TCAT CAGGATAT GAGT TT AC T GGGA 

■Ji AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
« GGAAGAGGAGGT GAAACAT T TAGGGTATGCT TCGCAT T TGAT GGAAATT TT T TGT GAC TC TGAAG 

Cl AAAACACGGAAGG TAC T TC T CTCACCCAGCAAGAGTC CCT CAGCAGAACAATACCCCCGGATAAA 

III 

ACAGT CAT TGAATATGAATATGATGTCAGAACCAC T GACAT TT G T GCGGGGCC TGAAGAGCAGGA 

o 

III GCTCAGT T TGCAGGAGGAGGT GT CCACACAAGGAACAT TAT TGGAGT CGCAGGCAGCGT T GGCAG 

Q TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
111 CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCT T CT GAGGGGGAT GGGCT CGGAGAGGAGGG TC T T CTATCTAGAC TCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAAT GAAACCTATC TCAT GCAAT T CATGGAGGAATGGGGG T TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MSYNGLHQRYFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYOTSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 




Signal peptide: 



Important features: 



amino acids 1-28 



Transmembrane domain: 



amino acids 140-163 



y] 



N-glycosylation sites. 



amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAG ATG GCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCAC CCT CAGT C CCAC TGCAGT T CT CATCC TCGGCCCAAAAGT CATCAAAGAAAAGC T GAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
AT CT GGCT GAAGGTCATCACAGC TAACAT CC TC CAGC TGCAGGTGAAGC CC TCGGCCAATGACCA 
|4 GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
0 TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
Cf CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
JJj TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
III CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
4* CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
%| TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 

AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
J| TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
f% ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
f|| TGAT CAAT GAAAAGGC TGCAGATAAGC T GGGAT CTACCCAGATC GTGAAGAT CC TAAC T CAGGAC 

S3 ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 

m 

TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT TACACCAAAGGTGACCAACTTATACTCAACT TGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWT FTLLCGLLAATL I QATLS PTAVL I LGPKVT KEKLTQELKDHNAT S I LQQL PLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQWPSANDQELLWIPLD^AGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLWALAKQVMNLLVPSLPNL 
VKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCC TGGCAGAGAGACT CT GAAATGAGGGAT TAGAGG T G TT CAAGGAGCAAGAGC 
TTCAGCCT GAAGACAAGGGAGCAGT CCCT GAAGACGC T TC TAC T GAGAGGT CTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCC TCT GGAT GGAATGT GCCACACACAGCACAGGCATCACC CAGT G TGACAT C TATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
Q ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
|fl CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 

3.. 3 

W CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
*JJ CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
,f| GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
* GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
13 GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
jjj GCTAGTGTAACAGCATGCAGGTTGAATTGCC7\AGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
Jf? TCACCTTGCTGCTCCCCTGGCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
fii GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
f|| TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 

GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACT AAT T T GT GC AT GAAC T GAAATAAAAC CAT C C TAC GG TAT C CAG GGAAC AGAAAGCAGGAT G 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS T LLGL PAD I QAAQAMMVT S S AI S S LAC 1 1 S WGMRC T VFCQE S RAKDRVAVAGGVFFI LGGL 
LGFI PVAWNLHG I LRDFYS PLVPDSMK FE I GEAL YLG IISSLFSLIAGIILCFSCSS QRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins . 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGGGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
t. CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
E| ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
Cl GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
J : J ! TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
lg AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
\l) AC TGCCAGACT CC TGC TGCCAAGCC T GCAAAGAT GAGGCAAGT GAGCAAT CGGATGAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
p| CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
ftj GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
Q CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACC TAA CAGTT GCAGAT AT GAGC TGTATAATTGT TGT TAT TATATAT T AATAAA 
TAAGAAGT T GCAT TAC CCT CA&AAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



WPETOVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGIMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 82 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTT CTCT AGCGT GGT CC ACCT CT AC AG GGACGGGAAG GACC AGCC AT TT AT GC AGATGC C ACAGT ATC 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
8 AC AGAC TC CAGGAC AAAC AG AG AC AT GC AT GGCCT GTT TGAT GT GGAG AT CT CT CTGACCGT CC AAGAGAA 

.u 

II CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 

Jjjj GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

PI TT TT TT GGC ATT GTT GGACTGAAG ATTT TC TT CT CC AAAT TCC AGT GGAAAATCCAGGCGGAACT G GACTG 

;s«? 

I GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTC AC TCT GAG AAGAGAT T TACAAGGAAGAGT GTGGT GGCT TCT C AGAGTT T C CAAGC AGGGAAAC ATT A 
CTGGGAGGT GGACGGAGG AC AC AAT AAAAGGTGGCG CGTGGG AGT G TGCCGGGATGATGT GGACAGGAGGA 
AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
AC AT T AAATCCC CGT TTT ATCAGCGTCT TC CCCAGGACCC C ACCT ACAAAAAT AGGGGTC TTCCTGGACT A 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATT GAGGCCCT ACAT TGAGT ATC CGTC CT AT AATGAGCAAAATGGAACT CCCAT AGTC ATC TGC 
CC AGT C ACCC AGGAAT CAGAGAAAGAGGCC TCTT GGCAAAGGGCCT CT GC AAT C CCAG AGAC AAGC AACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GACATT AC AT TT AGTT T GCT CT C ACT CC ATCT GGCT AAGT GATCTT GAAAT ACC ACCT CT CAGGTGAAG 
AACC GT CAGG AATTCCCAT CTCAC AGGCTGTGGT GT AGAT TAAGT AGACAAGGAAT GT GAAT AATGCTTAG 
ATCT T ATTGATG ACAGAGT GTATCCT AATGGTTT GT TCAT T AT ATT ACACTT T C AGT AAAAAAA 



•■ •:• 
%? 5 
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FIGURE 8d 

MAMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
LYRDGKDQPFMQMPQYQGRTKLYKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPL I S I TGYVDRDI QLLCQS SGW FPRPTAKWKGPQGQDLS TDSRTNRDMHGL FDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAI PETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCT C T GGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAAC TGC TGACGAT GCAGAG T TCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT TGCACCC TGAGCATCAGAGAT GCCAGAAGAAGTGATGCGGGGAGA 
TACT TCT TT CGTATGGAGAAAGGAAGTATAAAATGGAAT TAT AAACAT CACCGGCTC TC TGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
fj AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
gl GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
W GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
.« GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
5 GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
13 GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAAC GTCT CCC T GCAGAGCAAAGCCACAT CAGGAGTGACTCAGGGGGT GGTC GGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGAT TCT TGTAGAAT TAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAATTATG 
T GCAGAGT GAAAAGCACACAGGC T T T AGAG T CAAAGTATC TCAAACC T GAAT CCACACT GTGCCC 
T CCC T T T TAT T T T T T TAAC TAAAAGACAGACAAATT CC TA 



ru 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
T CAT AGCGACCACCAGAGGATGGAGT ACAGAT GAGGC TAATAC T TAC T T CAAGGAAT GGACC TGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
HI AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
JJJ GT TTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGAC TGACAACGGCC 

HI CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
,f| GGCCAGCGGGAAT TCAC T GCGGGAT T T GT TCAGT TCAGGGTAT T TAAT AACGAGAGAGCAGCCAA 

TiK? 

B CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

t3 GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 

T TGAGAGT T T T GT GGGAGGGAACCCAGACCT CT C CT C CCAACCAT GAGAT C CCAAGGAT GGAGAA 
III 

pi C AAC T TAC C C AG TAG C T AGAAT G T TAAT G G C AGAAG AG AAAAC AAT AAAT CAT AT T GAC T C AAG A 
|!| AAAAAA 



n 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8Q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGACT TATATACAGT T T TAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE go 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VEIAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAAGATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
T G GAAG GACAGG C C T GCCCATG C AGGAGAC CAT C TGGACAC C GGG CAG G GAAG GG G TTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGTGGGC TC TC TGTGCAGCCTCACAGGGCTT TGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 



FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLIAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALMLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGC GATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTT CCGAT T T GCATAT TATAAACT CTT AAAAAAAGCCAGTGAAGGT TT GAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
G TGGAGTAT T T T CCT T TGT GAATACC CT AT CT GACTCC T T GGGGCCAGGCACAGT GGGCAT TCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
T CCT GACCCACCTGCTGGTGTCAGCCCAGACCT TCAT AAGT TCT TAT TAT GGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCT CAGGGAAC CAGCACT TCCCAAACCGCAGAC TACATC TT TAGAGGAAGCACAAC T GT GCCT 
TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 



App_ID-10063549 



Page 245 



FIGURE Q4 

MTAAVFFGCAFIAFGPALALWFTIAIEPLRIIFLIAGAFFWLVSLLISSLVWFMARVIIDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FS FVNT LS DS LGPGT VG I HGDS PQFFL YSAFMTLVI I LLHVFWG I VFFDGCEKKKWGI LL I VLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q* 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAAT CAGGTCTT TCCT TCTTTAAGTCTGATACCATTAACA 
CAGAT GC TCACAC TGGGGCCAGATCT GCAT CT GT TAAATCC TGC T GCAGGAATGACACC TGG TAC 
CCAGACCCACCCAT T GACCC TGGGAGGGTT GAAT GTACAACAGCAAC T GCACCCACAT GTGT TAC 
CAAT T T T T GTCACACAAC T TGGAGCCCAGGGCAC TATCC TAAGCTCAGAGGAAT TGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCC TCCCAACTCCCAGT GGCACAGATGACGAC T T T GCAGTGACCAC CCCT 
GCAGGCAT C CAAAGGAGCACACATGCCAT CGAGGAAGCCACCACAGAAT CAGCAAAT GGAAT TCA 
G TAAG CTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GAT TAT AT TAT GGAAT AGAT T GAGACACAT T GGATAG T CT T AGAAGAAATTAAT T C T T AAT T 
TACC T GAAAATAT TCT TGAAAT TT CAGAAAATATGT TC TATGTAGAGAAT CCCAAC TT TT AAAAA 
CAATAATT CAAT GGATAAAT CT GT C T T TGAAATATAACAT TATGC T GCC T GGAT GATATGCATAT 
TAAAACAT AT T T GGAAAAC TGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?yy^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

^ amino acids 1-16 

i' 

t?i 
III 

Us 



1:1 

a 

ru 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
2? CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
■ Jjf CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
CP CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
||| T GCAGAGGGAAACC TC TACGTGACCAGAGAGC TGGACAGAGAAGCCCAGGCT GAGTACCTGCTCC 

||"| AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 

ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
*p CAGT CCACCAGGTAC T GAAG TGAC TAGACT GTCAGCAGAGGATGCAGATGCCCCCGGC TCCCCCA 

Wl ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
« CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
pj CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
pS TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
Vj ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
13 CC T C GAGCCCGCCTTCCGCCT CAT GGATT T T GCCATT GAGAGGGGAGACACAGAAGGGAC T T T T G 

||| GC C T GGAT TGGGAGCCAGACTCT GGGCATGT TAGACTCAGAC TC T GCAAGAACCT CAGT TAT GAG 

pi GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 

AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
* ¥ ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTC TGA ATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCT GCCCT GGGGTGGAGGCACCATCACCAT CACCAGGCATGT C T GCAGAGCC TGGACACCAAC T T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S TSLDHALERTYQLLVQWDMGDQASGHQATATTC^ 

VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLWDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHIIPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 7 62-784 
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FIGURE qq 



Q 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT T C T TCCCAGGAACACAAACGT AGGAGACCCACGC TC C T GGAAGCACCAGCC T T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATC T TGC T T T CCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAAT G 
T TCT CCT T AT GT TT GG TC TAC T ATT GCAT T TAGAAGC TGCAACAAAT T CCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCAT AGT CACCAAC T CTGAGT TCCATACAACC T C CAGTGGGAT CAGCACAGCCACCAAC TC TGAG 
T T CAGCACAGC GTCCAGTGGGAT CAG CATAGCCACCAAC TC TGAGTCCAGCACAACC TCCAGTGG 
GGCCAGCACAGCCACCAAC TC T GAGT CCAGCACACCCTCCAGT GGGGCCAGCACAGT CACCAAC T 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGT AGGGCCAGCACT GCCACCAACTCT GAGTC TAGCACAC TCT CCAGT GGGGCCAGCACAGCCAC 
CAAC T CT GAC T CCAGCACAACC TCCAGTGGGGC TAGCACAGCCACCAAC TCT GAG TCCAGCACAA 
C CT CCAGT GGGGCCAGCACAGCCAC CAACTC T GAGT CCAGCACAGTGT CCAGT AGGGC CAGCACT 
CP GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
||| AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
|«« GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
%J TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 

GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
\Q CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
s AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
f-| CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 

CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTC TGACT CCAGCACAACCT CCAGTGAGGC CAGCACAGCCACCAACT CTGAGT C TAG 
13 CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
Fif ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 

ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGT CC TAAC TGGT TC TGGAGGAGACCAGTATCATCGATAGCCAT GGAGAT GAGCGGGAGGA 
ACAGCGGGCCC TGAG CAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCT TGAAGAAGGTAT TCCTCACCT TTCT TGCCTT TACCAGACACTGGAAAGAGAATACTATAT 
T GCTCAT T T AGC TAAGAAATAAATACAT CT CATC TAACACACACGACAAAGAGAAGC T GTGC T TG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT C T GGC AT T CAAAAT C T C CACAG T AAAAT C CAAAGACC T C AAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



if! 



7? '- 2 
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FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNSNET S TSANTGS S VI S S GAS TATNS GSS VT S S GVS TAT I SGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSE'SSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGGGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAAC TATCAGGCTGAAAAACAGAGTGGGTACTCTCTT CT GGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGT T TCTAGC TACT GCACGTACT TCATAAAGCAGGACT CT AAAAGC T TTGGAATCAT 
GGTGT CATGGAAAGGGAT TTACTT TATACTGACTCTGTTT TGGGGAAGCT TT TT TGGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GATTATAACTGGGGATGCATTTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAA 
fri GAATGGACT GGATGTTCCTGTGGAAT TGCCT GATGCGATATAGCTACCTCAGAT TGGAGAAAATT 

III TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 

CTTCATTCATAGGAAATGGAAGGATGACAAGAGCCATTTCGAAGACATGATTGATTACTTTTGTG 

*R 

AT AT T CACGAACCAC T TCAACT CCT CATAT T CCCAGAAGGGACT GAT CT CACAGAAAACAGCAAG 
TC TCGAAGTAATGCAT T TGC TGAAAAAAAT GGAC T TCAGAAATAT GAATAT GT T T TACATCCAAG 
Q AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
|!1 ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
*** CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
P| CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 

GGGAGAAGAATTTTTAT TT TACCGGACAGAGTGTCAT TCCACCTT GCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATTTGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACTTTTACACAAACAGCCACAT 
T TAAAT T CAAAGAAAAAT GAG TAA GAT TAT AAGGT T T GCCATGT GAAAAC CTAGAGCATAT TT T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 
T T T AAT C TCTGAATGTAAT T T CGATACT GT GTACATAGCAGGGAGT GAT C GGGGTGAAATAAC T T 
GGGCCAGAATAT TATTAAACAAT CAT CAGGCT TT TAAA 



HI 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNI PQSEKHLLQGDFPRE IHFHVHRYPI DTLPTSKEDLQLWCHKRWEEKEERLRS FYQGEKNF 
Y FTGQSVI PPCKSELRVLWKLLS I LYWTLFSPAMCLL I YL YSLVKW YFI I T I VI FVLQERI FGG 
LE I I ELAC YRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 

T CCAAAT CAT C CATCCACCCCTGCT GTCAT CTGTTT TCATAGT GT GAGAT CAACCCACAGGAAT A 

TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 

CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 

TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 

CACC T CTACAGAGATGGGGAAGACT GGGAAT CTAAGCAGAT GCCACAGT AT CGAGGGAGAACTGA 

GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 

ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 

CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 

GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 

AGGAT T T GT CT T CAGACTCCAGAGCAAAT GCAGATGGGTACAGCC T GTAT GATGT GGAGATC TC C 

AT TATAGT CCAGGAAAAT GC TGGGAGCAT AT TGTGT T CCATCCACCT TGCT GAGCAGAGTCATGA 

GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 

TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 

AAAT CCAAAGGGAAAATCCAGGCGGAACT GGACT GGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 

T T T CT GAT CTGAAAAC TGTAACCCATAGAAAAGC TC CCCAGGAGGTGC C T CAC TC T GAGAAGAGA 

T T TACAAGGAAGAGT GTGGTGGC T TCT CAGGGT T TCCAAGCAGGGAGACAT TACT GGGAGGT GGA 

CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 

ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 

ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 

GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 

T GACAT GTCAGT T TGAAGGC T TGT T GAGACCCTATAT CCAGCAT GCGATGTATGACGAGGAAAAG 

GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 

CC CACACCACAGACCCAGACACAGCCAAGGGAGAGT GC TC CCGACAGGT GGCCCCAGCT TCC TCT 

CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 

TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 

TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 

CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

ACAGT GATT CCT GCC T CACAGGT GAAGAT TAAAGAGACAACGAAT GT GAAT CATGC TT GCAGGTT 

TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 

TGCTTATTCCACATTAATTTACTTTTCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 

T GCT T T GT GAGGC TCAAAGAATAAAGAGGAGGTAGGAT TT T T CACT GAT TC TATAAGCCCAGCAT 

TACC TGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATC C 

CT CAT T AACACAGACACAAAAAT T CTAAATAAAAT T T TAACAAAT TAAACTAAACAATATAT T T A 

AAGAT GAT AT ATAAC TACTCAGTGTGGT T T GTCCCACAAATGCAGAGT T GGT T TAATAT T TAAAT 

AT CAACCAGTGTAATTCAGCACAT TAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 



FIGURE 104 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAVVH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGS LPL I S I VGYVDGG IQLLCLS SGW FPQPTAKWKGPQGQDLSS DSRANADGYS L YDVE I S I 
IVQENAGSILCSIHI^QSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQOTGWYVGVCRDDVDRGKlSnWTLSPNNGYWVLRLTTEHLYFT 
FNPHFI SLPPS TPPTRVGVFLDYEGGT I S FFNTNDQS L I YTLLTCQFEGLLRPY IQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE los 



CCTTCACAGGACTCTTCATTGCTGGTTGGCA ATGA TGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT T CAT T AT GTGAGAT AT AAT C AAAAGAAG ACCT AC AATT ACT AT AGC ACATT GTC AT TT AC AAC 
T G AC AAACT ATATGCT GAGT TT GGC AGAGAGGCTT C TAAC AATT T T AC AG AAAT GAGCC AGAGACTTGAAT 
C AATGGTG AAAAAT GC AT T T TATAAATCT C C AT T AAGG GAAGAATTTGTC AAGT CTCAGGTT AT C AAGTT C 
AGT C AACAGAAGCAT GGAGT GT TG GCTC AT AT GCTGTT GATT TGT AGATTTCACTC T ACT GAGGAT CC TGA 
AACT GT AGAT AAAATT GTTC AACT TGTT T T AC ATGAAAAGCT GC AAGATGCTGT AGGACC CCCT AAAGTAG 
ATCCTC ACTC AGTT AAAAT T AAAAAAAT CAAC AAGAC AG AAACAGAC AGC TATCT AAACC AT T GCT GCGG A 
ACACGAAG AAGT AAAACTCT AGGT C AGAGT CT CAG GAT CGTT GGTGGG AC AGAAGT AGAAGAGGGT GAAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
T TGT GAGTGC TGCT C AC T GTTT T AC AAC AT AT AAGAACCCTGCC AGAT GG ACT GCT TCCT TT GGAGTAAC A 
ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
TG AC TATGAT AT TTCT CT T GCAGAGCTT TCTAGCCC TGTT CCCT AC AC AAAT GC AGT ACAT AGAGT TT GTC 
TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
GATGGT TACAGT C AAAAT C ATCTT CG ACAAGC AC AGGT GACT CT CAT AGACGCT AC AACT TGC AAT GAACC 
T C AAGC TT AC AAT GAGGCCAT AACT CCT AGAAT GTT AT GT GCT GGCT CCT TAGAAGG AAAAACAGATGC AT 
GCCAGGGTGACT CTGGAGGACCACTGGT TAGTTC AG AT GCT AGAGAT AT CT G GT ACCT T GCT GGAATAGTG 
AG CT GGGG AG AT GAAT GTGCGAAACCC AAC AAGC CT GGT GTT TATACT AGAGT TACGGCC T T GCGGGACTG 
GATT ACTTCAAAAACT GGT ATC TAA GAG AC AAAAGC CT CATGGAACAGAT AACATT TT TT T TTGTTTTTTG 
GGTGTGGAGGCC AT TT TT AG AG AT ACAG AATT GGAGAAGACT T GCAAAAC AGCT AG ATTT GACT GATC TCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACT CT GT CAT CTGT G AGCAAT AGTTGAAACT TT AT GT AC AT AGAGAAAT AG AT AATACAAT AT T AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGT GCAGAACAAGGAGTGAAAG AAAAT AT AAGAAGAAAAAAAT CCC CT ACAT TT T ATT GGCACAGAA 
AAGT AT TAG GTGTT T T TCTT AGTGGAAT AT T AGAAATGAT CAT ATT CAT TAT GAAAGGTC AAGC AAAG AC A 
GC AGAAT ACCAATC AC TTCATCAT TT AGGAAGT ATGGGAACT AAGT T AAGGAAG TCCAGAAAGAAGCC AAG 

AT AAT AAT T AT AC AAACT T CAT GCAATGTACTT GTT CT AAGC AAAT TAAAGC AAAT AT TT ATT T AACAT T G 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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MMYRPDWRARKRVCWEPWVI GLVI FI S L I VLAVC I GLT VHYVRYNQKKT YNYYS TLS FT TDKLY 

AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVTCSQVIKFSQQKHGVLAHM 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 

TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 

RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 

QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 

I VSWGDECAKPNKPGVYTRVTALRDW I TSKTG I 




Transmembrane domain: 



amino acids 21-40 (type II) 



U! 



App_ID=l 0063549 



Page 258 of 320 



FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT T CT CACT T TAACTGCAGAAACGAGGGTAGAGGAAGCAGTCAT T T T GAC T TAC TT TCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGT AGAAC T GGCT TGTGGCGT T T GGACATATGAACAGGAACT TATGG T T CCAGT ACAATGGTCA 
GATATGG T CAC T T T GAAAGCCAGGAT GACAAAT TATGGAT TACC TAGATAT CGGTGGC T TAC T CA 
T GCT TGGAATT T T T T T CAGAGAGAGTT TAAGTGC TGT GGAGTAGTATATT T CAC T GAC TGGT T GG 
AAAT GACAGAGATGGACT GGCCCCCAGAT T CC TGCT G T GT TAGAGAAT TCCCAGGATGT TCCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
T T TGAGAGGAACCAAACAAC TGCAGG T GCT GAGGTT TC TGGGAATCTCCAT T GGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAAT GATGTCC T TGAAGAAT GACAAC TCT CAGCACCT GT CATGT CCC TCAGTAGAACT 
GTTGAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACT 
T TGAGATGGAGGAGTTATAAAAAGAAATGTCACAGAAGAAAACCACAAACT TGTTT TATTGGACT 
T GT GAATT T T T GAGTACATACTATGT GT T T CAGAAATATGTAGAAAT AAAAATGT T GCCATAAAA 
TAACACCTAAGCATATACTATTCTATGCTTTAAAATGAGGATGGAAAAGTTTCATGTCATAAGTC 
q ACCACC T GGACAATAATTGATGCCCT TAAAAT GCTGAAGACAGAT GT CATACCCAC TG TGTAGCC 

f;| TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
1% T CCAT GCAAACGAGTCACATAT GGTGGGACT GGAGCCATAGTAAAGGT T GAT T TAC T TC TACCAA 

fj CTAGTATATAAAGTAC TAAT TAAATGC TAACATAGGAAGT TAGAAAATACT AATAACTT TTAT TA 

13 CTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTCAATATTGGTGACT 
R| ACCTAAAT G T GAT TT T TGC TGGT TACT AAAATAT TC T TAC CACT T AAAAGAGCAAGCTAACACAT 

Q T GTC T T AAGCTGATCAGGGATT T T T T GTATATAAGTCT GT GT TAAAT C T GTATAAT TCAGT CGAT 

T TCAGT TCT GATAATGT TAAGAATAACCATT AT GAAAAGGAAAAT T T G T CCT GTATAGCAT CAT T 



m 



PI 



ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGTT AT T TAAATACT TAACCACTAATT TTGAAAATTACCAGTGTGATACATAGGAATCAT TATTC 
AGAAT GTAGTC T GGT CT TTAGGAAGTATT AATAAGAAAAT T T GCACATAAC T TAGT TGAT TCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCT TATC TT TGCC TT C TCCAAACAAGAAGCAATAGTC T CCAAGT CAATATAAAT TC TACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
C TT TGTT T TAT T T CAC TGAT TAATAT ACT GT GGCAAAT TACACAGAT TAT TAAAT T T T T T TACAA 
GAGTATAGT ATAT T TAT T T GAAAT GGGAAAAGTGCAT T TTACTGTAT T T TG T G TAT T T T GTT TAT 
T T CT CAGAATATGGAAAGAAAAT TAAAAT GT GTCAATAAAT ATT T T C T AGAGAGTAA 
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MAREDSWCLRCLLYALNLLFWLMSISVIAV^ 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE too 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CTT TCTC GTGAGAGCCT AGAGGCCTT AAAAAAAAAAGT GCT T GAAAGAGAAGG GGAC AAAGGAAC A 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAACACT GACCACCAGTT GGATGAGTCT CAAGGTCCTCCT CT AT GT GAC AACCAT GT GAAT GGGGAG 
T GGT ACC ACTT C AC GGGCAT GGC GGGAGAT GCCAT GCCT AC CT TCT GC AT AC C AGAAAAC C ACT GT GGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAAT GG GAAC TGCTGTCTCT GGAACAC CAC GGT GGAAGTC AAGGC T T GCCC T GGAG GCT ACT AT GT GT AT C GT 
|>»b CTGACC AAGCC CAGC GTCT GCT T CC AC GT C T ACT GT GGT CAT T T T T AT GAC AT C T GC GAC GAGGACT GCCAT GG 

13 CAGCTGC T C AGAT AC CAGC GAGT GC AC AT GC GCT C C AGGAAC T GT GC TAG GC C C T G AC AGGC AGAC AT GCT T T G 

CI AT G AAAAT GAAT GT G AGC AAAAC AAC GGT GGCT GCAGT G AGAT CT GT GT GAACCT C AAAAACT CCT AC CGCT GT 

U$ GAGT GT G GGGT T GGC C GT GT GC T AAGAAGT GAT GGC AAGAC T T GT GAAGACGT T GAAGGAT GCC AC AAT AAC AA 

m TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 

§B 

**f CT GAGGAT AAC CAC ACT T GCCAAGT CC C T GT GT T GT GCAAAT C AAAT GCCAT T GAAGT GAACAT CCCC AGGG AG 

? CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
ni CCTCTTCTCT C TC AAGAC AT GT GGT AC AGT G GT C GAT GT GGT GAAT G AC AAGATT GT GGC C AGC AACC T C GT GA 

^ CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 

ACCTGCGAGTTTCCACGCCTGTACACCATTTCT GAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGC C GAAAT CAT GG GAT CT TC CC AT TC ACT CT GGAGAT CT T CAAGGAC AAT G AGT T T GAAGAGC CTT ACC 
»a GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
#*° s TTGGAAAGCTT GGT GGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCT GAAAT ACT ACCTCAT 

If! CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 

TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TT GGACGAGCGTTCCCGCTGTGCCCAGGGTT GCCACCGGCGAATGCGTCGT GGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
T GGGT T C AGACT TC ACACT GT GAGT T C AGAC T C CC AGC AC C AACT CACT CT GAT TCT GGT CC AT TC AGT GGGCA 
CAGGTCACAGC ACT GCTGAACAATGTGGCCT GGGT GGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CC CAGAAAGAC ACTC ACC C CAT T TCCC T CAT T T CT T T CCT AC AC T T AAAT AC CT C GT GT AT GGT GC AATCAGAC 
CACAAAAT CAGAAGC TGGGT AT AAT ATTTCAAGTT AC AAACCC T AGAAAAAT T AAACAGT TACT GAAAT TAT GA 
CTT AAAT ACCC AAT GACTCCTT AAAT AT GTAAATT AT AGTT AT ACCT T GAAATTT CAATTCAAAT GC AGACT AA 
T TAT AGG GAAT T T GGAAGT GT AT C AAT AAAAC AGT AT AT AAT T TT 



in 
C; 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLSGDGIVQRQACASFNGNCCLWNTTVSVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHBCEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

gwsa 

j*\ Important features of the protein: 

m Signal peptide: 

|s| amino acids 1-16 

N-glycosylation sites. 

%ll amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

P* 

Jit Tyrosine kinase phosphorylation sites. 

pi 

amino acids 411-418, 443-451 

III 

p| N-myristoylation sites. 

r|| amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 

522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



ir ; 



G AGAGAGGC AGC AGCTTGCT CAGCGG AC AAGGAT GCTGGGCGTGAGGG AC CAAGGCCT GC CCT GC ACT CGG 
GC CT CCTCCAGCC AGT GCTGAC CAGGGAC T TCT GAC CT GCT G GCCAGCC AGGAC CT GT GTGG GGAGGCCC T 
CCTGCTGCCT TGGGGT GAC AATCT CAGCT C CAG GCT AC AGGG AGACCGGG AGGATC AC AGAGCC AGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGG AG AC CTTC AGAAAGGT GGGGAT CCCC AT CAT C AT AGCACT ACTGAGCCT GGCGAGTAT CAT CAT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GG AAGC AGCT GT GTGACGGAGAGCT GGACT GT CCCT TGGGGGAGGACGAGGAGC ACT GTGTCAAGAGC TT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TCAGC ATCCAGT ACGACAAACAGC AC GT CT GT GGAGGG AGCATCC TGGAC CCCC ACTGGGTCCTCACG GC A 
GCCC ACTGCT T C AGGAAAC ATACCGAT GTGTT CAACT G GAAGGTGCGGGC AGGC TC AGAC AAACTGGGCAG 
CTTCCC ATCCCT GGCT GT GGCC AAGATC AT CATC ATTG AATT CAACCC C ATGT ACC CCAAAG AC AATG AC A 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GAT GAG GAGCTC ACT C CAGCCACCCC ACTCTGGATC AT TGGATGGGGCT T TACGAAGC AGAAT GGAGGGAA 
G ATGTC TG AC AT ACTGCT GC AGGC GTC AGT CC AGGT CATT GACAGC AC AC GGTGC AAT GC AGACGATGCGT 
ACCAGGGGGAAGTCACCG AG AAGATGAT GT GTGC AG GC AT CCCGGAAGGGGGTGTGGACACCTGCC AGGGT 
GAC AGT GGT GGGCCCC TG ATGT ACCAAT CT GACC AG TGGC AT GT GGTGGGC AT C GTT AGC TGGGGCT ATGG 
CTGCGGGGGC CC GAGC ACCCCAGG AGTAT AC ACC AAGGTCTC AGC C TATCTC AACT GGAT CTAC AATGTCT 
GGAAGGCTGAGCT GTAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGAT CCCC C AAAGT CAGACACAGAGCAAGAGT CCCCT T GGGT AC AC CCCT CT GC C CAC AGCC TCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GC AGCC CT AGCT CGGC CACACT TGGT GCTC CCAGCATCCC AGGGAG AG AC ACAGCCC ACT GAAC AAGGTCT 
CAGGGGTATT GCT AAGCC AAGAAGGAACTT T CC CAC ACT ACT GAAT GG AAGCAGGCTGT CTT GT AAAAGCC 
C AGAT C ACTGTGGGCT GGAGAGGAGAAGGAAAGG GT CT GCGC CAGCCCT GTCCGTCTT C ACCCAT CCC CAA 
GCCT AC T AGAGC AAGAAACC AGTT GT AAT AT AAAAT GCAC TGC CCT ACTGT TGGTAT GAC TACCGT T ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPI I IALLSLAS 1 1 IVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAET ACRQMGYS RAVE I GPDQDLDWE I TENS QELRMRNS SGPCLS GS LVS LHCLACGKSL 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS ILDPHWVLTT^AHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKI I I IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWI IGWG 
FTKQNGGKMS D I LLQAS VQVI DS TRCNADDAYQGEVTEKMMCAGI PEGGVDTCQGDS GGP LM YQS 
DQWHWGI VSWGYGCGGPS TPGVYTKVSAYLNV? I YNWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACT ATGC TGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCC CAAACCT GAC C T TCACTC TGGAACGAGAACAGAGG T T TC TACCCACACCGT CCCC TC GAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
IZ, CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
P\ AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
P GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGT GGCAGGGAAGGAAC T TG T GCCAAAT TAT GGGT CAGAAAAGAT G 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGT AC CAGC TGC TGAAGGGCAAC TGCAGGCCGATGCT C TCAT CAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
G T GCGAACACGGAAAT GCC TCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGAC TGTG T GG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAAT GAT C TCCAAGGGCCCTTATACCCCAGGAGAC T T TGATT TGAAT T T GAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CC AGC TAC T CGGGAGGCTGAGACAGGAGAAT TAC TT GAACCTGGGAGGTGAAGGAGGC T GAGACA 
GGAGAAT CACT TCAGCCTGAGCAACACAGCGAGACTC TGT CTCAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 



III 

P 



TPS 1 

fii 
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FIGURE lid 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 



m 



3-: 

O 

a?* 
•as? 

?II 
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FIGURE lis 



C AG CAGT G G T C T C T CAGT C C T C T CAAAGC AAGGAAAGAG T AC TGTGTGCT GAGAGAC CATGGC AA 
AGAATCC TCCAGAGAATTG TGAAGACT GT CACAT TC TAAATGCAGAAGCT T T TAAAT CCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
G TT T T GGGGGAGCAAGCAC T TC T GGC CGGAGGTACC CAAAAAAGCCTATGACATGGAGCACACT T 
T CTACAGCAATGGAGAGAAGAAGAAGAT T T ACAT GGAAAT T GAT CCTGTGACCAGAAC TGAAATA 
T TCAGAAGCGGAAATGGCACT GATGAAACAT T GGAAG TGCACGAC T T T AAAAACGGATACAC T GG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAAT TACCACAACT TTCT TTGAACAGTCAGTG 
iaf AT TT GGG T CCCAGCAGAAAAGCCTAT T GAAAACCGAGAT T T TC T TAAAAAT T CCAAAAT TC T GGA 

GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
yl ACT T T GAGGAGGAGGGAGAAGAT CT TCAC T TTCC TGCCAACGAAAAAAAAGGGAT TGAACAAAAT 

Iff GAACAGT GGGT GG TCCCTCAAGT GAAAGTAGAGAAGACCC G TCACGCCAGACAAGCAAGTGAGGA 

4* AGAAC TT CCAATAAAT GAC TAT AC T GAAAAT GGAATAGAAT T T GATCCCAT GCTGGATGAGAGAG 

*W GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
L GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
Iff TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
Q TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
ill GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 

#*! 

St T TAT C AC C AAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HT FYSNGEKKK I YME IDP VTRTE I FRS GNGTDETLEVHDFKNGYTG I YFVGLQKC FIKTQI KVI P 
E FSEPEEE I DENEE ITTT FFEQSVI WVPAEKP IENRDFLKNSKI LE I CDNVTMYWINPTLI S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPC3MWWVARMLGRV 



Important features of the protein: 
H* Signal peptide: 

amino acids 1-40 

W 
HI 

la Transmembrane domain: 

Iff amino acids 25-47 (type II) 

T '- < N-glycosylation sites. 

#«* amino acids 94-97, 180-183 



3 y 



Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 



Microbodies C-terminal targeting signal. 

amino acids 315-317 



Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATC ATG TCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCT GC AT CGCGGCC ACC GGGATGGACAT GTGGAGC ACCC AGGACC TGT ACGACAACC CCGT CACCT 
CC GT GTTCC AGT ACGAAGGGCT CT GGAGGAGC TGCGTG AGGC AGAGT T CAGGCT TC ACCGAAT GCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
||| T GGGTC CAAC AC CAAAAAC AAGAAGAT AT ACG AT GGAGGT GCCC GC ACAGAGGACG AGGT AC AATCTT ATC 

||1 CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 

CCCAAAAAACAAGG AGAT CCCATCT AGATT TCTT CTTGCT T T T G ACTC AC AGCT GGAAGT TAGAAAAGCCT 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTGAGTT ATTT AT GAAT T AGAGGCT AT AG CTC AC ATTTT C AAT C C TCT ATT T C TT T T TT T AAATATAACT 
T TCT ACTCT GAT GAGAGAATGTGGTTTT AATCTCTC TCTCAC AT T T TGATGAT T T AGACAGACT CCCC CTC 

ill 5 

»±L TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 

i-J 

ns GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

HI 

j*% AC ACT T ACTG AAGAAGAAGC AATAAGAGAAAG AT AT TTGT AATCTCTCCAGCCC AT GAT C TCGGT TTT CTT 

f|| ACACTGTGATCT T AAAAGTT ACCAAACC AAAGT C AT TTT C AGTTTGAGGC AACC AAACCT TT CT AC TGCTG 

TTGACAT CTT CTTAT T ACAGC AAC ACCATT CT AGGAGTTT CC TG AGCT CT C C ACTGGAGT CCTCTT TCTGT 
CG CGGGTC AG AAATT GTCCCT AGAT GAATGAG AAAATT AT TT T TTT T AATT T AAGT CCT AAATAT AGT T AA 
AATAAATAAT GT TT TAGT AAAATG AT ACACT ATCTC TGT G AAAT AGCCT C ACCC CT AC AT GT GGAT AGAAG 
GAAATGAAAAAAT AAT TGCTTT GACAT T GT CT AT AT GGT ACT T TGT AAAGTC AT GCTT AAGT AC AAAT TCC 
AT GAAAAGCT CACACC TGT AATCCT AGC AC TTTGGG AGGCTGAGGAGGAAGGAT C ACT TGAGCC CAGAAGT 
TCGAGACT AGCCTGGGCAAC ATGGAGAAGC CCT GTCT CT AC AAAAT ACAGAG AGAAAAAATC AGCCAGTCA 
T GGT GG CAT ACACCTGT AGT CCCAGC AT TC CGGGAGGCTGAGGT GGGAGGAT CACTT G AGCCC AGG GAGGT 
T GGGGCTGC AGT GAGC C ATGAT CACACC AC TGCACT CC AG CCAGGT GACAT AGC GAGATC CTGT CT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRMjMIVGIVLG 

CAI AGVS VFANMLVTNFWMS TANMYTGMGGMVQTVQTR YT FGAAL FVGWVAGGL TL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
j!| CAGATACT CGATACCTT CCCATCGCACAACCCAAAAAAGTTAT CACACCGGAAAGAAGTCACCGAGCG 

rn TCTACTCCAGAAGTCAGTAT GTGTAGTTGTGTATGT T TTT T TAACTT TACTAT AAAGCCATGCAAATG 

*P ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
tfl AAT CT TAATTACAGGAACTGTGCATCAGCTAT TTAT GATTCTATAAGCTATT T CAGCAGAATGAGATA 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCATTAT GTACATAGATGAGT GTAACAT TTATATCT CACATAGAGACAT GCT TATAT GGT 
m T TTAT TTAAAATGAAAT GCCAGTCCAT TACACTGAATAAATAGAACTCAACTATTGCTTT TCAGGGAA 



9 

O 
Ill 



ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 



sir a 

1 1| TTTAT AAT GAAGATTAAAAT GAAGGCT TTAATCAGCATT GTAAAGGAAATT GAAT GGCT T TCTGATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAAGAAAT CATATAT GTATGGAT 
ATAT T TTAATAAGTATT TGAGTACAGACT TT GAGGTT TC AT CAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT T TATAT TAAT AAAT TGTACATTTTTCTAATT 
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FIGURE 120 

MATHALE I AGLFLGGVGMVGTVAVTVMPQWRVS AFI ENN I WFENFWE GLWMNCVRQANI RMQCK 
IYDSLLALSPDLQAARGLMCAASVMS F]^FMMAILGMKCTRCTGDNEKVKAHILLTAGIIFIITG 
MWLI PVSWVANAI I RDFYNS I VNVAQKRELGEALYLGWT TALVL I VGGAL FCCVFCCNEKS SSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 
amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAATG TC TGAGGGAAAGCT T T GAGGAG T CCTGGACACC CAAC TAC 
AAGCAGT GT TCATGGAGT T CAT T GAAT TATGGCATAGATCT TGGGAAAAT T GCGGAGT G TACAT T 
Q TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 

GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATT GAAGC T ATAAT T TAT T TGGACCAAGGAAGCCCTGAAAT GAAT T CAAC AATTAAT AT TCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 



|y 

in 

T T GGCACT T GT TCAGAT TACCCAAAAGGAGATGC TT CTAC T GGAT GGAAT T CAGT T TC TCGCATC 
tfl AT TAT T GAAGAACT AC CAAAATAAAT GCT T TAAT TT TCAT T TGC TACCTCTT TT TT TAT TATGCC 

L T T GGAAT GGT TCAC T TAAAT GACAT T T T AAATAAGT T T ATG TAT ACAT C TGAATGAAAAGCAAAG 

T % 

u5 C TAAAT AT G T T T AC AGAC C AAAG T G T GAT T T C ACAC T G T T T T TAAAT C T AGCAT TAT T CAT T T T G 

PI CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
ftj CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 

AAAAAATATAAAAGC TACCAAT C T T T GTACAAT T TGTAAATGT T AAGAAT t T T T T T TATATC TGT 
T AAATAAAAAT TAT T T CCAACA 



jfss 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGGATGGGTTTCAATG 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
lZ ATAACAGCCTTCAAAAACT TAAGTATAATCAAT TTAAAGGGCTCAACCAGCTCACCTGGCTATAC 

!;f CT TGACCATAACCATATCAGCAATATTGACGAAAATGCTTT TAAT GGAATACGCAGACTCAAAGA 

P GCTGAT T CT TAGT TCCAATAGAAT C T CCT AT TT TC T TAACAAT ACCT T CAGACC TGTGACAAATT 

|f! TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
ll'll CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
SJ AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 

T CT T T GCTGGCATGAT CAGACTCAAAGAACT TCACC T GGAGCACAATCAAT T TTCCAAGCTCAAC 
4 s CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
\U CATAGGACAGACCAT GTCCT GGACCT GGAGCTC C TTACAAAGGC T TGAT T TATCAGGCAATGAGA 

- TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
pi TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 

TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
III TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
0 AT CGAT GCAGTGAAGAAC TACAGCAT CT GT GGCAAAAGTAC TACAGAGAGGT T T GATCTGGCCAG 

HI GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
f i TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 

TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCC TAAAGCAAATGAC TCCCAGCACCCAGGAAT T T TAT GT A 
GATT AT AAACC CACCAACACGGAGACCAGCGAGATGC TGC TGAAT GGGACGGGACCC TGCAC C T A 
T AAC AAAT C GGGC TC CAG GGAG T G T GAGG T ATGAA C CAT T G T GAT AAAAAG AGC T C T TAAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CATT T T CC TC TCATACATAAT CAACC CAT TG AAAT T TAAATACCACAAT CAATGT GAAGCTT 
GAACT CCGG TT TAAT ATAATACC TAT T GTATAAGACCCT T TACTGAT T CCAT TAATGT C GCAT TT 
G T T T T AAG AT AAAAC T T C T T T CAT AG G T AAAAAAAAAAA 



Hi 
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FIGURE 124 

MGFNVI RLLS GS AVALVI APT VLLTMLS SAERGCPKGCRCEGKMVYCE SQKLQE I PS S I SAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
IJ^^FAGMIRLKELHLEHNQFSKLNIAiFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVT DAVKNYS I CGKS TTERFDLARALPKPT FKPKL PRPKHE 
SKPPLPPTVGATEPGPETDAD7\EHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of" the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

c&MP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



o 

III 



CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAGGAAGGC T TTC TAAAGCT TTGGCAAGGAGTGACACC CGCCAT T TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T T AGCCAATCCAACT GACC TAGT GAAGGT T CAGATGCAAAT GGAAGGAAAAAGGAAAC T GGAAGG 
AAAACCATT GCGATT TCGTGGTGTACATCATGCATTTGCAAAAATCT TAGCTGAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACTTATGATACAGTGAAACACTACTTGGTATTGAATACACCACTTGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAAT GAATCAACCACGAGATAAACAAGGAAGGGGAC T T T T GTATAAAT CATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
T GAGT GGAGTCAGT CCAT T T TAA 



AppJD=10063549 



Page 277 of 320 



FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGG IRGLWAGWVP 
NIQRAALVmGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 




amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACATCCT CTCCAT GT TCT CAGC TATCC GT TCT CAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT T CGCCT T CAAACT CCACGACAAGGACCC T GAGAAGGCAGCT GCCACC TAT GAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACAC CAGGGCC TGGCCCAGT GGATT T CATGG TGATCAT TAAAAAAGAAAAAT CGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVTGASGGIGAAVA^ 

PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGIARPDTLLSGSTSGW 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PGWE TQFAFKLHDKDPEKAAAT YEQMKCLKPEDVAEAVI YVLS T PAHI QI GD I QMRPTEQVT 



Important features of the protein: 
Signal peptide: 

amino acids 1-17 



01 N-myristoylation sites. 

J*;J amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 

%l 199-205 

»!•?, 

s Short-chain alcohol dehyrogenase . 

13 amino acids 30-42, 104-114 

111 



m 
■ ■■ ■■■ 
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FIGURE 12Q 



AACTTCTAC ATGG GCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGT TGAGTGCTTGAAATGAGGAACTGAGAAAATTAATT TC TCATGTATTTTTCTCAT TTAT TTA 
T TAAT TT TTAACTGATAGTTGTACATATT TGGGGGTACATGTGATAT TTGGATACATGTATACAA 
T AT AT AAT GAT CAAATCAGGGTAACTGGGAT AT CCAT CACATCAAACAT T TAT T T TT TAT TCTTT 
O TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CP CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
U? GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
*ji CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
J% GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
;5 ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
CJ ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 



fi 



'■sr ? 



fU 



AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGT AAGAAAAT GCAATAT T T GT CTT TC TGT GCC T GGC TTAT TT CACT TAACATAATGAC T TCCTG 
T TCCATCCATGT T GCT GCAAAT GACAGGATT TCGT TC TTAATT TCAAT T AAAATAACCACACAT G 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGES PMPFKRVFCQDGNVRS FCVCAVH FS SHQPPVAVECLK 



Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

*«l amino acids 86-92 

pj Zinc carboxypeptidases , zinc-binding region 2 signature. 

IH amino acids 68-79 

m 
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FIGURE 131 



T TCT GAAGT AAC GG AAGC T ACCTT GT AT AAAGAC CT CAAC ACTGCTGACC ATGATC AGCGC AGCCT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC CAGAGGATGC T AC AACT CTCT ACCT T CAGAAC AACCAAAT AAAT AATGCT GGGATT CCTT C AGAT 
T T GAAAAACT TGCT GAAAGT AGAAAGAAT AT AC C T ATACC ACAAC AGT TT AG AT GAATTT CCT ACCAACC T 
CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 
AAAT TC CCTATC TGGAAGAATT AC AT TT AGAT GAC AACT CT GTC TCTGCAGT T AGC AT AG AAG AGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
* GCCCAGGACT AT AG AAGAACTACGCTTGGATGAT AATCGC AT AT CC ACT ATTT C AT CACC AT CT CT TC AAG 

#*i GTCT CACT AGTCT AAAACGCCT GGTTCT AGAT GGAAAC CT GT TGAACAAT CATGGT TT AGGT GAC AAAGT T 

ffk TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
||| TCC AGG CACAAACCTGAGGAAGCTTT AT CT T C AAGAT AAC CACATC AATC GGGT GC C CCC AAATGCTTTTT 

111 CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 

r 

GATGATTTGGACAATATAACACAACTGATTCTTCGCAACAATCCCTGGTATTGCGGGTGCAAGATGAAATG 
^ GGT ACGT GAC TGGTT ACAAT CACT ACCT GT GAAGGT CAACGT GC GT GGGC TC AT GT GCCAAGCCC C AGAAA 

AGGT TC GTGGGAT GGC TAT T AAGGATCT CAAT GC AG AACT GT TT GATT GTAAGG AC AGT GGGATT GTAAGC 
AC CATT C AGATAACCACT GCAATACCC AAC AC AGTG TAT CCT GC CCAAGG AC AGTGGCC AGCTCCAGTGAC 



1 m 

CAAACAGCCAGAT AT T AAGAACCCCAAG CT C ACT AAGGAT CAAC AAACC ACAGGGAGTCCCTCAAG AAAAA 
III CAAT TAC AAT TACTGT GAAGTCTGT C ACCT CTGAT ACCATTC AT AT CT CTTGGAAACTTGCT CTACCT ATG 

$*% ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
fll AACAGGGG AACGCAGT GAGT AC TT GGTC AC AGCCCT GG AGCCTG AT TCACCCT AT AAAGT AT GC AT GGTTC 

CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CG AAT GTACAAC CCT ACAACC ACCCTC AAT CG AGAGC AAG AGAAAGAACCTT AC AAAAACCCCAAT TT AC C 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT C AT AGG AATGGATCGCT CTT CT C AAGGAACT G TGCAT ATAGC AAAG GGAGGAGAAGAAAGGATGACT AT 
GC AGAAGCT GGC AC TAAG AAGG AC AACTCT ATCC TGGAAATCAGGGAAACT T CT TT TC AG AT GT T ACC AAT 
AAGC AAT GAACCCATCTCGAAGGAGG AGTT TGT AAT ACAC ACCATATT T CCT CCT AAT GGAAT GAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKMLLBCVERIYLYHNSLDEFPTNLPKYWELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWTODWLQSLPV 
KVlSr^GLMCQAPEKTOGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKT I T I TVKSVTSDT IHI SWKLALPMTALRLSWLKLGHS PAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIAIjLA^ 

ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGT C T GGGC T TGGCCAGAGCTCAGGGT GCTGAGC GT G T GACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCTTCGGGGAGCCT GATGCTGAAGATGAAGAATTAT CTAAAGCTAT TCAA 
TATCAGCAGCATT TT TCGAGGAGAGTGAAGAGGCGAGAAAAACAATT TCCAGAT TCTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCT GAAAAGC AC T T TCAAAGAGACTGTGTT GAATAAAGGGCC AAGGT T C TT GCCACCCAGCAC TC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
T T TAGTCC T CATC CC T TAGATCC TGGAGGGCACGGATCACAT CCT GGGAAGAAGGCAT C TGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATTVAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGIARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGA2\AGGAAAAATGAGAGCAAGATATG 
T GAAT T ACAT CAAAACATCAGAGGTT GT CAGAC TGCCCTATCCT CT CCAAAT GAAATC TT CAGGT 
CCAC CTT CT TAC T T TAT TAAAAGGGAAT CGT GGGGC T GGACAGACT T T CTAATGAACCCAAT GGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
C TGACAT GAGACGGGAAAT GGAGCAGTCAATGAATAT GCTGAAT T CCAACCATGAGT T GCC TGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGTCAATAT TGATGTCACTGAATTAAT TACAGTGTCCTATAGAAAATGCCATTAATAAAT TAT 
ATGAACTACTATACATTATGTATAT TAAT TAAAACATCT TAAT CCAGAAAT CAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYWEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPM^^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



Important features of* the protein: 
Signal sequence: 

!•£ amino acids 1-23 

rt 

Transmembrane domain: 

CP 

r~ a amino acids 161-182 

UJ 

if! 

*|* N-glycosylation site. 

%l amino acids 184-187 

^5 Glycosaminoglycan attachment sites. 

in amino acids 37-40, 236-239 

l»f cAMP- and cGMP-dependent protein kinase phosphorylation site. 

*** amino acids 151-154 



N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 



Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
H AAGCT GT GTGAT CGCCACAAACC T TCAGGAAATACGAAATGGAT TTT CT GAGATACGGGGCAGT G 

T GCAAGCCAAAGAT GGAAACAT T GACAT CAGAAT C T TAAGGAGGACT GAGT CT T TGCAAGACACA 
Si AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
fjj AAAC TACCAGACCCC TGACCATTAT ACTC T CCGGAAGAT CAGCAGCCTCGCCAATTCCT TT CT TA 

111 CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
*JJ AAGAAAT ACAGCCAGATT CTGAGTCAC T T TGAAAAGC TGGAACC T CAGGCAGCAGT TGT GAAGGC 

T TT GGGGGAAC TAGACAT TC TT CTGCAAT GGAT GGAGGAGACAGAATAGGAGGAAAGTGAT GC TG 

5S — — — — 

Pi CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 

111 ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 

"3 C TTCCT T GCATGAT TGTC T T TATGCATCCCCAATCT TAAT TGAGACCATACTTGTATAAGAT TTT 



ru 



T G TAAT AT CTTTCTGC TAT T GGAT AT AT T TAT TAG T TAAT AT AT T TAT T TAT T T T T T G C TAT T T A 
ATGTATTTATTTTTT TACT TGGACAT GAAACT TTAAAAAAATTCACAGATTATAT TTATAACCTG 
ACTAGAGCAGGT GATGTAT TTT TATACAGTAAAAAAAAAAAACCT TG TAAATTC TAGAAGAG TGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 128 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SIANSFLTIKKDLRLSHAHMTCHCGEEAM^ 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N -myr i s toyl a t i on s i tes . 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



m 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC TGGAAAAAGAATAGTC T T TTAT GGAGATGAAAC C T GGGT TAAATT A 
TTCCCAAAGCAT TTTGTGGAATATGATGGAACAACCTCATT TTT CGTGTCAGAT TACACAGAGGT 
GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGC T GAGCGAGATGGACAGCGT GC T GAT GAAGATCCACACCT CACT GCAGTCGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
ACT T GGC T TACCGAT TCCAAAAGACAGTGTAGGGAGCC T CCTAT TC CCAGT T GTGGAAGGAAGAC 
III CAATGAGAGAGCAGT TGAGAT TT TTACAT TTGAATACAGTGCAGCTTAGTAAACTGT TGCAAGAG 

£ AAT GT GCCGTCATAT GAAAAAGATCC TGGG TT T GAGCAG T T TAAAAT GT CAGAAAGAT TGCATGG 

1| GAACT GGAT CAGACTGTAC TT GGAGGAAAAGCAT TCAGAAGTCCTAT T CAACCT GGGC TCCAAGG 

*k TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
* _ TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
© CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
|f| CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
&% GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 

AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
0 CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
||| TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC T CAGAT CCACAGAGCC CAGGAT CAAGG GACCCAC T GCAGTGGCAGCAGGACT GTT GGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGWFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Arnidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAAT T CAGCAT TCATGACCAGGATCACAAAGTACTGGTCC T GGACTC TGGGAATC T CAT 
AGCAGTT CCAGATAAAAACTACATACGCCCAGAGATC TT C T TT GCAT TAGCCTCATCCT TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT T GGGGT GACAGATAAAT T TGAGAACAGGAAACACAT TGAAT T T T CATT T CAACCAGT T T GCAA 



J?; AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
S! AT T T G AAC T AAT T G T AT AAAAAC AC C AAAC C T G C T C AC T 



3. 

R K5 



ni 



if, a 
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FIGURE 142 

MLLLLLE YNFP I ENNCQHLKTTHT FRVKNLNPKKFS I HDQDHKVLVLDS GNL I AVPDKNY I RPE I 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



CTAGAGAGTATAGGGCAGAAGGATGGCAGATGAGTGACTCCACATCCAGAGCTGCCTCCCTTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT T TGCTGT CCACACAT T CAACCAACAGAGCAAGGAC TACT AT GCCTACAGACT GGGG 
U C ACAT C T T G AAT TCC T GGAAGG AGCAGGTGGAGT CCAAGAC TGT AT T C T CAAT GGAGC TACTGC T 

U GGGGAGAAC T AGGTGT GGGAAAT T TGAAGAC GACAT T GACAAC T GCCAT T TCCAAGAAAGCACAG 

AGCT GAACAATAC T T T CACC TGC T TC T T CACCAT CAGCACCAGGCCC T GGAT GACTCAGT TCAGC 
CT CC TGAACAAGACC T GCT T GGAGGGATT CCAC TGAGT GAAACC CAC T CACAGGC T TGTCCATGT 
||! GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 

TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACAC AT C T T GAGCCTAATC AT GT AGT GTAGATCAT TAAACATC AGCAT T T T AAGAAAAAAAAAAA 
L AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 1AA 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGH I LNS WKEQVESKT VFSMELLLGRTRCGKFEDD IDNCH FQE STELNNT FTC FFT I S TRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 14* 



CT GT GC AGC T CGAGGC TCCAGAGGCAC AC T CCAGAGAGAGC CAAGGT T CT GACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCTVAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
C CCAGGCGGCGAACCAGGGGGAG T TCCAGAAGC CAGACAACAAGC TCCACCAGCAGGT GC TC TGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
Q TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
P AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 

CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GTGTAT T CCCGT T T TACAGATAAGGAAACT GAGGCC CAGAGAGC T GAAG TAC TGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 



j*% GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
rii CT CTAATGAAAT T GT GAAAGCT CCAT GT T TAGAAATAAATGAAAACACCTGA 
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FIGURE 146 

MRKHLS WWWLATVCMLL FSHLSAVQTRG IKHRIKWNRKALP S TAQ I TEAQVAENRPGAF IKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEAWTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 



if** 



Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 157-171 



Iff N-glycosylation sites. 

,g amino acids 98-102, 110-114 



Tyrosine kinase phosphorylation site. 

amino acids 76-83 



j|| N-myristoylation sites. 

O amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

ni 

Amidation site. 

amino acids 62-66 



App_ID=10063549 



Page 298 of 320 



FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide : 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCT CCAGAAAACAAGCCAGGT CAGAGCAACTAT TCTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
TAAGAAGCTCCCCACTTGATAATAAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTG 
ATCGATGATTATGACT CTACTAAGAGTGGAT TGGAT CAT AAAT TTCAAGATGATCCAGAT GGT CT TCA 
TCAACTAGACGGGACTCCTT TAACCGCT GAAGACAT TGT CCATAAAATCGCT GCCAGGAT TTATGAAG 
7VAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTT TTACAAAAATTAATCT CAAAGGAAGCCAACAA 
T TAT GAGGAGGATCCCAATAAGCC CACAAGCTGGACTGAGAATCAGGCT GGAAAAATACCAGAGAAAG 
T GACT CCAATGGCAGCAATT CAAGATGGT CTTGCTAAGGGAGAAAAC GATGAAACAGTAT CTAACACA 
T TAACCTT GACAAAT GGCTT GGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTT GAGGAACT CCA 
ATAT TTC CCAAATTT CT ATGCGCT ACT GAAAAGTATT GAT T CAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACTGAT TACTAT CATGAAAACACTGAT TGACT TTGT GAAGATGATGGT GAAATATGGAACAATATCT 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCT TTTC CCAGCACCAT CAGAGAAGAGTCATGAAGAAA 
C^GACAGTACC^GGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAA^ 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT C AGAAAAAAT AT TGAAT GGT T GAAGAAAC AT GAC AAAAAGGGAAAT AAAG AAGATTAT GACCT T T 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGC^AAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAA(^TAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTAT TAGAAAGTGCTGAAT TTACAGTAGTTAACCT TTT ACAAGTGGTTAAAACATAGCT TTCTT CC C 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 
amino acids 143-148, 239-244 
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FIGURE igi 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGC T T C TAGC TGGAGGGC TGCATGCAGGGAAGGT CAT TAAAGGTGAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAG GGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGT GGAGGAGACCCATGGCGGACAAT CAC T CTCTC T GCTC TCAGGAC CCCCAC GTCTGAC T TAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAAT GGCTCGAGCTCAGAAGAT AAAAGATAAGTAGGG TAT GCT GAT C C TC T TT TAAAAACCCAA 
GATACAAT CAAAATCCCAGATGCT GGT C T C TAT T CCCATGAAAAAGT GC TCATGACATATTGAGA 
AGACC TACT TACAAAGTGGCATATAT T GCAAT TTAT T T TAATT AAAAGATACCTAT TTATATATT 
TCTT TATAGAAAAAAG T C T GGAAGAG T T TACT T CAAT TGTAGCAAT GT CAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT TCC T TGTATAAAAATAAGAAAAGAAAT TAATC T TGAGGTAAGCAGAGCAGACAT CAT C T C TGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T T GTAGTAGTGATCAGGAAACAGAT C T CAGCAAAGCCACTGAGGAGGAGGC T GT GC TGAGT T TGT 
GT GGC T GGAATC TCT GGGTAAGGAACT TAAAGAACAAAAATCATCTGG TAAT TC T T T CC TAGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTT^GAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
G T CT C TGCAGAT G TAGT TAGT TAAGACAAGGTCATGC TGGATGAAGGTAGAC CTAAAT TCAATAT 
GACT GGT T TCC T TGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATG T A 
AAGAT GAAGGCAGAGATCGGAGT T T TGCAGCCACAAGC TAAGAAACACCAAGGAT TGT GGCAACC 
AT CAGAAGC T TGGAAGAGGCAAAGAAGAAT TCT TCCC TAGAGGCT TT AGAGGGATAACGGC TC T G 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAAT TGGTTACAGCAGCT CTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAAT AT GGCAGAAGTGATGGCAT GCCACT TCCAAGAT TAGGT TATAAAAGACAC TGCAGC T T C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GC CCACAGCCACAT TAGTGAACC TAGAAGCAGAGAC TC TGTGAGATAATCGATG T TTGTTGTTTT 
AAGT T G CT CAGT TT TGGTCTAAC T T GT TAT GCAGCAATAGATAAATAATAT GCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE lfra 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
C AG C C T TAT AT G CAGGAG GTGGTGCCCTTCCT GG CCAGGC T CAG C AACAG G CT AAGCACAT G TCA 
? , TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
h% TTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTCTGAGAAAT 
Q GCC T GCATT TGACCAGAGCAAAGCT GAAAAATGAATAACTAACCCCC TT TCCC T GCTAGAAA.TAA 

CO CAAT T AGATGCCCCAAAGCGATT TT T T TT AACCAAAAGGAAGAT GGGAAGCCAAACT CCATCAT G 

ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
1» AGACCAGAAGGTAGAC TT T C TAAGCATAGAT AT T TAT TGATAACAT T T CAT TGTAAC TGGT GT T C 

T AT AC AC AG AAAA.C AAT T TAT T T T T T AAAT AAT TGTCTTTTTC C AT AAAAAAG AT T AC T T T C CAT 
n TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
O T GT AT T TAT TAT TAT T ATAAGACT GC AT T T TAT T TATATCATT T TAT T AAT AT GGAT T TAT T TAT 

|J| AGAAACATCATTCGATAT TGCTACT TGAGTGTAAGGCTAATATTGATATTTATGACAATAAT TAT 

AGAGCTATAACATGT T TAT T TGACCT CAAT AAACAC T TGGATAT CCC 

Ill 
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FIGURE 154 

MAALQKS VS S FLMGTLATSCLLLLALLVQGGAAAP I S SHCRLDKSNFQQPY I TNRT FMLAKEASL 
ADNNTDWLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myr i s toyl a ti on s i tes . 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
fa GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
Q CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
W TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 

GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 

III 

sg» AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
J;; GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 

ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
35 CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 

pi 

!!! CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 

II! 

GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCAC TT T C TAGATAT T TCCCCC T TGC TGGAGAAGAAAGAGCCCC TGG T TT TAT T 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T T GACAT GGAT GAT T C TGAG GAG GAAG C T G T TAT T GAAT G T AT AGAGAT T TAT C C AAATAAATAT 
CTTTATTTAAAAATGAAAAA 



1,1 

I II 

m 
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FIGURE 1*6 

MRERPRLGEDSSLISLFLQWAFLflMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGAC T TGAGGGACC TCCGAGTAGAACCT GT TACAAC TAG T GT T GCAACAGGGGAC TAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATT T GT GTGAC GGGCAAAAGCAAC T TCCAGTCC TACAGCTGT GTGAGGTGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTC TATT TCATTGGGGCCCATAATAT TCC TAATGCAAATATGAATGAAGATGG 
U CCCT T CCAT GTC TGT GAAT T TCACCT CACCAGGCTGC C TAGACCACATAAT GAAATATAAAAAAA 

Q AGTGT GT CAAGGCCGGAAGCCT GTGGGAT CCGAACAT CAC TGC T T GTAAGAAGAATGAGGAGACA 

O GTAGAAGTGAACTTCACAACCACTCCCGTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 

fl TATCATCGGGT TTTC TCAGGTGT TTGAGCCACACCAGAAGAAACAAACGCGAGCT TCAGTGGTGA 

III 

83* TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 

>'. 

w g AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
\U GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
* CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 

T CT ACCACCACACTAC TGCCCC CCAT TAAGGT T C TT G TGG T TTACCCAT C T GAAATATGTT TCCA 
%t TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
fll AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
C) GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
*** GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

GTGATCTAAGAAGCCAGATTCATCTGCACAAATACGTGGTGGTCTACTTTAGAGAGATTGATACA 
AAAGACGAT TACAATGCTC T CAGTGT CT GCCCCAAGTACCACC T CAT GAAGGAT GCCACTGC T TT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTG TAG 
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FIGURE is8 

MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPWTSVATGDYSIIMST^SWV 
LRADAS I RLLKATKICVTGKSN FQSY SCVRCNYT E AFQTQTRPSGGKWT FSY IGFP VELNTVY FIGAHNI P 
NAl^EDGPSMSVNFTSPGCLDHIMKYKKKCVPCAGSLWDPNITACKKNEETVEWFTTTPLGNRYMALIQH 
ST I IGFSQVFEPHQKKQTRASVVI PVTGDSEGATVQLTPY FPTCGSDC IRHKGT WLCPQTGVP FPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICY FTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNL FCSDLRSQI HLHKYVWY FRE I DTKDDYNAL S VCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



ij 

ry 



AGCCACCAGCGCAACATGAC AG T GAAGACCCT GCAT GGCCCAGCCATGGT CAAGTACT TGC TGC T 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACT CAGCT GAAGAAG 
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FIGURE 160 

MTWTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCS VS FQLEKVLVT VGCTCVT PVI HHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACT GGCCAAACAAAAACGAAAGCACTCCG TGC TGGAAGTAGGAGGAGAGT CAGGACT CCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTG'CCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGT TT GGAGGAGCAGCTGACTCAGGGGT GGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
G TGGC TGTACT TCAC TACCCAGCAAAGCC T CCACGAGGGCAGCT C GCCT TGGAGAGTAC T TAC T A 
U? CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
O CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
f|| CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
n CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
|j* CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
?W CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 

CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 
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MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYE)CFEAALGSEVRIWSYTQPRYEKEI^HTQQLPALPWLNVSADGD^fVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQL WDDDLG AL WACPMDKY I H KRWALVWLAC LL FAAAL S L I LLLKKDHAKGWL RLL KQD VRSGAAARG 
RZUUL.LLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVVVLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
L P SQL P D FLG ALQQ PRAP RSGRLQ E RAE Q V S RAL Q P AL D S Y F H P PGT P APGRG VGPGAG P GAGDGT 

Signal sequence: 
amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 



HI N-glycosylation sites. 

a 



amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-58 6 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

GTCAGT GCGGGAGGCCGGT CAGCCACCAAGATGAC TGACAGGT T CAGCTC T CT GC AGCACACTAC 

CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T TC TACCACTTAGAGC T CCAGG T CAACC GCACCTACCAAATGCACCT T GGAGGGAAGCAGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACCTAGTTAGGGCAGCCAGACATCTCCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTAT GGGGTATGCATGGAAGG T TCT GGCAAAGACT CCCCCAC T GGGACAC TTT C TAG TCC TAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

C C C T G G GG AT T T G C AC AGAC AG AAC AT C T G AC C C AAAT G T G C T AC AC AG T G G GG AGGAAGG GAC A 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACT CC CGAGGT CCAGAGCCCCACCT C 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCAT GAGGCCCCTGCC TC T TCTGTCAT TGT TCAAAGGTGGGAAGAGA 

GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 

GAAAT GCAGGT CCACCAGGGAGGGAGACACACAAGCC T TT T CTGCAGGCAGGAGT T TCAGACCCT 

ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

C T CACCAC T CACAAGC TGTGTGAC T T CAAACAAATGAAAT CAGT GCCCAGAACC T CGGT TT CC TC 

ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE i6a 

MRTLLTILTVGSI^AHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRS I QMI VHP TPTP I RAGDGHRLT LED I FHDL FYHLELQVNRT YQMHLGGKQRE YE FFGL T PDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNS LNVQRVLT FQPLRF I QEHVL I PVFDL S G P S S LAQPYQ YS Q I RVS G PRE PAGAPQRHS L S E I T 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence, 
amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC ATG G 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTG7\AGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGT TGAACACAGT TT TGGATAT T T TCCAAAAGAT T TGAT CAAGGTACT T CATAAATACACGGA 
AGAAGAGCTACATAT TCCAGCAGATGAGACAGACT TTGTCTGCTTTGAAGGAGGAAGAGATGAT T 
T TAAT AGTTATAAT GTAGAAGAGCT T TTAGGATC T T TGGAACT GGAGGACT CT GTACCTGAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGT TAT T 
CAT TACAGCAAAGGATT TC GT T GGCATCAAAATCTAAGT T TGT T T TACAAAGAT TGT T T T TAG TA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSD^ 

FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 



m 
III 



N-glycosylation site. 



amino acids 294-298 




cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 



Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 



N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site. 



amino acids 28-32 



AppJD=10063549 



Page 318 of 320 



FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTT GAGAATGAAGCAGAACAGAAGT TAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGAT T TC TGAT GGT GAT T T CT GGATAGGGC TT TGGAGGAAT GGAGAT GGGCAAACATC TGG 
T GCC T GCCCAGATCT C TACCAGT GGT C TGAT GGAAGCAAT TCC CAGT ACCGAAAC T GGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTAT GAACCAGAGAT TAATCCAACAGCCCCTGTAGAAAAGCCTTAT CT TACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
G CATAAAAG TAAAGGAAGAACAAAAAC TAGT CCAAAC CAGT CTACACTGTGGAT T T CAAAGAGTA 
C CAGAAAAGAAAGTGGCATGGAAGT ATAA TAAC T CAT T GAC TT GG TT CCAGAAT T TTGTAAT TCT 
GGATC TGTATAAGGAATGGCAT CAGAACAATAGCT TGGAAT GGCT TGAAAT CACAAAGGATC TGC 
AAGAT GAAC TGTAAGC TCCCCCT TGAGGCAAATAT TAAAGTAAT T TT TATAT GTC TATTAT T TCA 
T T T AAAGAATATGCT GTGC TAATAAT GGAGT GAGACAT GC T TAT T TT GC TAAAGGATGCACCC AA 
AC T TCAAAC TTCAAGCAAAT GAAAT GGACAAT GCAGATAAAGT T GTT AT CAACACGTCGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
H| ATGAACT GT TC TAATATT TATT T T TAT GG CAT C T CAT T T T TCAATACAT GC TC T T T TGAT TAAAG 

p| AAACT TATTAC TGTTGTCAACTGAATT CACACACACACAAATATAGTACCATAGAAAAAGT TTGT 

!!f! TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 

AATAAGAAGCTAT T T CAT TAAGT GT GATATAAAC C TCC TCAAACATT TT AC T TAGAGGCAAGGAT 
T GTC T AAT T TCAAT T GTGCAAGACAT GTGCC TTATAAT TAT TT T TAGCT TAAAAT TAAACAGAT T 
T TGTAATAATGTAACT TTGTTAATAGGTGCATAAACACTAATGCAGTCAAT TTGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGT TGCC TATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATT T T AGGAAGGAAAGGAACTACGAAAT CGT GTGAAAATGGGT TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWS LLLGAALLCGHGAFCRRWS GQKVC FADFKHPC YKMAY FHE LS SRVS FQEARLACE S E 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
P YLTNQPGDTHQNVWTEAG 1 1 PNL I YWI PT I PLLLL I L VAFGT CC FQMLHKS KGRTKTS PNQ 
STLWISKSTRKESGMEV 



111 



M 
fit 

CI 

ill 



Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 



III N-glycosylation sites. 

*S amino acids 8 6-89, 255-258 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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